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(57) Abstract 

The present invention relates to non-pathogenic strains of HTV-1 and to components, parts, fragments and derivatives thereof and 
to genetic sequences derived therefrom and their use in the development of diagnostic and therapeutic compositions for the treatment and 
prophylaxis of AIDS and AIDS-related disorders. The present invention also relates to a method for attenuating pathogenic strains of HTV-1 
by mutagenizing particular regions of the HIV-1 genome. 
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NON-PATHOGENIC STRAINS OF HIV-1 

5 The present invention relates to non-pathogenic strains of HIV-1 and to components, 
parts, fragments and derivatives thereof and to genetic sequences derived therefrom and 
their use in the development of diagnostic and therapeutic compositions for the treatment 
and prophylaxis of AIDS and AIDS-related disorders. The present invention also relates 
to a method for attenuating pathogenic strains of HIV-1 by mutagenizing particular 
10 regions of the HIV-1 genome. 

Bibliographic details of the publications referred to in this specification are collected at 
the end of the description. Sequence Identity Numbers (SEQ ID NOs.) for the 
nucleotide and amino acid sequences referred to in the specification are defined 
15 following the bibliography. 

Throughout this specification, unless the context requires otherwise, the word 
"comprise", or variations such as "comprises" or "comprising", will be understood to 
imply the inclusion of a stated element or integer or group of elements or integers but 
20 not the exclusion of any other element or integer or group of elements or integers. 

Genomic nucleotide sequences of HTV-1 strains referred to herein are represented by 
their corresponding DNA sequence. 

25 Exemplary viral isolates referred to herein as "CI 8" and "C98" were deposited at the 
PHLS Centre for Applied Microbiology and Research, European Collection of Animal 
Cell Cultures (ECACC), Division of Biologies, Porton Down, Salisbury, Wiltshire SP4 
OJG. C18 was deposited on 17 October, 1994 under Provisional Accession Number 
V94101706 and C98 was deposited on 31 October, 1994 under Provisional Accession 

30 Number V941031169. Viral isolate "C54" was deposited at ECACC on 14 February, 
1995 under Provisional Accession Number 
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A summary of particular deletion mutants of HIV-1 of the present invention referred to 
herein is given in Figure 1 1 . 



Acquired Immune Deficiency Syndrome (AIDS) and AIDS related disorders are the 
5 clinical result of infection by Human Immunodeficiency Virus type I (HIV-1) (Barre- 
Sinoussi et a/, 1983). Infection by HIV-1 is generally characterised by progressive 
immune system damage (Teeuwsen et al, 1990; Clerici et al, 1989) leading to 
opportunistic infections, malignancies or wasting syndrome that constitute clinically- 
defined AIDS (Busch et al, 1991; Klaslow et al, 1990). 

10 

The high mortality rate of individuals infected with HIV-1 together with the social and 
economic consequences of the continuing HIV-1 epidemic has created an urgent need 
for a safe and effective treatment and/or prophylaxis against the devastating effects of 
AIDS. However, despite over a decade of high level scientific research into the 

15 pathogenesis of HIV-1 and the clinical manifestations of the disease, together with a 
detailed molecular analysis of the virus, there has been little success in the development 
of an effective vaccine. To date, the most effective theriapy is treatment with zidovudine 
(AZT) which delays the onset of full blown AIDS and alleviates to some extent the 
symptoms of HIV-1 infection. However, AZT is not an innocuous compound and AZT, 

20 metabolic products thereof or impurities therein can cause a number of side effects 
which limit long term treatment with the drug. Furthermore, AZT resistant isolates have 
been reported during treatment. Clearly, therefore, a need exists to develop alternative 
strategies in preventing and treating HIV-1 infection. 

25 The initial phases of HTV-1 infection are summarised by Levy (1993) as involving 
attachment, fusion and nucleocapsid entry. These phases have been the traditional foci 
in research into development of antiviral strategies. The molecular events at the virus 
genomic level have also been the subject of intense scientific research with an aim being 
the development of a live attenuated vaccine as a possible approach for the treatment or 

30 prophylaxis of HIV-1 infection. 
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There is a high variable rate of progression from initial HTV-1 infection to AIDS which 
reflects a rapidly changing pathogen and variable immune response of the host to 
infection (Sheppard et al, 1993). With regards to the latter, HIV-1 can be considered 
5 as a heterogenous group of viruses differing at the genetic level with concomitant 
variable pathogenicity. For example, HTV-1 strains can differ in their capacity to kill 
cells. Furthermore, it appears that HTV-1 strains evolve in a host after infection and that 
the evolution varies depending on the tissues infected by the virus. The major sites in 
the genome apparently responsible for biological and pathological variation are the 
1 0 highly variable envelope region (Cheng-Mayer et al, 1 99 1 ; Shioda et al, 1 992; Hwang, 
et al 1991; Sullivan et al, 1993; Groenink et al, 1993) and the viral regulatory regions 
such as tat (Leguern et al, 1993). The genetic complexity of the HIV-1 group of viruses 
together with their variable pathogenicity, are major difficulties in the development of 
live vaccines, genetic vaccines or component vaccines. 

15 

Notwithstanding the highly pathogenic nature of HTV-1, there are some reports of long 
term survival of subjects infected with the virus (Learmont et al, 1992; Levy, 1993; 
Sheppard et al, 1993; Lifson et al 1991). It is not always clear, however, whether a 
benign course following HIV-1 infection is due to host factors, viral factors or other 
20 unknown factors. There are reports that most infected people have at least laboratory 
evidence of progressive immune system damage in the form of CD4+ cell loss (Lang 
et al, 1989) and defective immune responses (Clerici et al, 1989). 

Although simian monkeys have been used as an in vivo model for HIV and Simian 
25 Immunodeficiency Virus (SIV) infection, a major handicap in AIDS research has been 
the absence of suitable in vivo models to study the pathogenesis of the disease and, in 
particular, to study the viruses involved in benign infection. The need for a suitable in 
vivo model is heightened by the fact that results obtained in vitro cannot necessarily be 
extrapolated to what occurs in vivo. This was clearly observed by Mosier et al (1993) 
30 where conflicting results were obtained in animals compared to cell cultures. 
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Despite the absence of suitable in vivo models, considerable scientific research has been 
directed to attenuating HIV-1 strains by mutagenesis of the virus genome. Deletions in 
the we/gene have been implicated in attenuated strains of SI V and their use in providing 
5 protective effects in monkeys (Daniel et al, 1992). . However, there are conflicting 
reports on the possible negative influence the nef gene product has on the rate or extent 
of virus replication (Terwilliger e/ a/, 1986; Luciwe/a/, 1987; Niederman etal, 1989; 
Kim et al, 1989; Hammes et al 7 1989). In fact, Kim et al (1989) found that nef did not 
affect HIV-1 replication or HIV-1 long terminal repeat (LTR)-driven CAT expression. 

10 Kestler III et al (1991) found that the nef gene is required for full pathogenic potential 
in SIV. However, such is the complexity of the HIV-1 group of viruses and the 
variability of immune responses between individuals let alone different species, that it 
is far from clear whether nef deleted strains of HTV-1 would behave similarly to nef 
deleted strains of SIV-I. There is a need, therefore, in order to investigate the possibility 

15 of «e/deleted HIV-1 strain as a vaccine candidate, to identify individuals infected with 
such modified viruses. 

Learmont et al (1 992) reported that a cohort of five persons infected with blood products 
from a single HIV-1 infected donor have remained asymptomatic from up to about 10- 

20 14 years after infection. Subsequently, a sixth person has been identified as being part 
of the cohort. Both the donor and recipients were HTV-1 seropositive but with no 
indications of clinical symptoms of HIV-1 related disease and CD4+ cell number and 
p 2 -niicroglobulin levels have remained in the normal range. The identification of this 
cohort of benignly infected individuals provides a unique in vivo model in which the 

25 pathogenesis of HTV-1 infection can be studied at the clinical and molecular biological 
levels. 

However, it has not always possible using conventional isolation procedures to routinely 
and reproducibly isolate viral strains from the above mentioned donor or recipients 
30 which has frustrated efforts to investigate the cause of the asymptomatic individuals. 
In accordance with the present invention, methods have now been established to isolate 
viruses from the above individuals. It has been determined, in accordance with the 
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present invention, that the six individuals of the cohort are infected by non-pathogenic 
strains of HIV- 1. Furthermore, the non-pathogenic strains of HIV- 1 carry one or more 
nucleotide mutations. The non-pathogenic strains of the present invention enable the 
generation of a range of therapeutic, diagnostic and targeting agents against HIV-1 
5 infection. The present invention also enables the attenuation of previously pathogenic 
strains of HIV-1. 

Accordingly, one aspect of the present invention contemplates a non-pathogenic isolate 
of HIV-1 or a component, part, fragment or derivative thereof. 

10 

In a related embodiment, there is provided a novel isolate of HIV-1 or a component, 
part, fragment or derivative thereof wherein said HIV-1 isolate is capable of stimulating 
in a human or primate subject an immune response such as a humoral immune response 
to at least one HTV-1 glycoprotein such as but not limited to gp41-45, gp 120 and/or 
15 gpl60 while not substantially reducing in said human or primate subject proliferative 
responses and cytokine production to a mitogen, alloantigen and/or recall antigen 
compared to a healthy, non-infected human or primate subject. Preferably, the cytokine 
is IL-2. Preferably, the mitogen is ConA or PHA and the recall antigen is influenza or 
tetanus toxoid. Preferably, the HIV-1 isolate is non-pathogenic. 

20 

More particularly, the present invention relates to an isolated HTV-1 strain which: 

(i) is substantially non-pathogenic in human subjects; and 

(ii) carries one or more mutations in its genome resulting in the inability to direct 
synthesis of at least one pathogenic HIV-l-derived polypeptide or protein. 

25 

Even more particularly, the present invention provides an isolated HTV-1 strain which: 

(i) is substantially non-pathogenic in human subjects; and 

(ii) carries a mutation in the we/gene and/or a long terminal repeat (LTR) region or 
in a functionally equivalent location in the HIV-1 genome. 
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Still even more particularly, the present invention is directed to an isolated virus which: 
(i) has a genome which is capable of hybridising under medium stringency 
conditions to complementary nucleic acid from a pathogenic strain of HIV- 1; 
5 (ii) is substantially non-pathogenic in human subjects; 

(iii) carries one or more deletion mutations in a region of its genome corresponding 
to a «e/ gene in said pathogenic strain of HIV-1; and 

(iv) optionally carries a mutation in one or both LTR regions. 

10 In a related embodiment, there is provided an isolated virus which: 

(i) has a genome which is capable of hybridising under medium stringency 
conditions to complementary nucleic acid from a pathogenic strain of HIV-1; 

(ii) is substantially non-pathogenic in. human subjects; 

.(iii) carries one or more deletion mutations in an LTR region of its genome; and 
15 (iv) optionally carries a mutation in a region corresponding to a nef gene in said 
pathogenic strain of HTV-1 . 

In a further related embodiment, there is provided an isolated virus which: 

(i) has a genome which is capable of hybridising under medium stringency 
20 conditions to complementary nucleic acid from a pathogenic strain of HIV-1; 

(ii) is substantially non-pathogenic in human subjects; and 

(iii) carries one or more deletion mutations in a region of its genome corresponding 
to a region which contains we/coding sequences and LTR nucleotide sequences. 

25 In a particularly preferred embodiment, the present invention provides non-pathogenic 
HIV-1 isolate C18 deposited at the ECACC on 17 October, 1994 under Provisional 
Accession Number V94101706. 

In a related embodiment, the present invention provides non-pathogenic HTV-1 isolate 
30 C98 deposited at the ECACC on 3 1 October, 1 994 under Provisional Accession Number 
V941031169. 
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In another embodiment, the present invention provides non-pathogenic HIV-1 isolate 
C54 deposited at ECACC on 14 February, 1995 under Provisional Accession No. . 

Although pathogenicity is a relative term, it is used herein in relation to the capacity of 
5 a strain of HIV-1 to induce AIDS or AIDS-related disorders in an individual over time. 
Accordingly, a "non-pathogenic" strain of HIV-1 is a strain which, at the clinical level, 
does not lead to the development of AIDS, at least within the median time of 6-10 years 
following infection with HIV-1. At the laboratory level, a non-pathogenic strain of 
HIV-1 is considered not to alter CD4+ cell counts or p 2 -microglobulin concentrations. 

1 0 In addition, a non-pathogenic strain of HIV- 1 may not alter CD8+ and CD3+ cell counts 
and would not alter lymphocyte counts. CD4+:CD8+ ratios also remain unchanged 
relative to normal non-infected individuals. Furthermore, generally, a non-pathogenic 
strain of HIV-1 does not induce p24 antigenaemia. A non-pathogenic HTV-1 of the 
present invention is generally still infectious but individuals infected with the virus 

15 remain free of symptoms for at least 6-10 years after infection. 

A laboratory classified non-pathogenic strain of HIV-1 may be determined at any time 
after infection. The term "non-pathogenic" is not to be considered as a strain that is 
never pathogenic under any conditions as this might depend on the host individual, the 

20 level of immune responsiveness in that individual and the extent or otherwise of other, 
for example, immune comprising disorders. Accordingly, a "non-pathogenic" HTV-1 
isolate of the present invention may also be considered a "low virulent" strain of the 
virus. A non-pathogenic strain of HTV-1 as contemplated herein may be isolated from 
an asymptomatic individual or may be derived from a pathogenic strain by mutation. 

25 Although the present invention is not to be limited to any particular pathogenic strain 
of HIV-1, for reference purposes, an example of a pathogenic strain is HTV-1 NL4-3 
strain as described by Myers et al (1992). 

The non-pathogenic nature of the HIV-1 of the present invention is conveniently 
30 evidenced by the cohort of seven individuals comprising one donor and six recipients 
which have remained free of symptoms or signs of HTV-1 infection for greater than the 
median time of 6-10 years. However, the individuals of the cohort are seropositive for 
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HIV-1 following infection with the virus as determined by Western blot anal>\ 
genetic analysis (e.g. using PCR techniques). A seropositive individual is one $' 
reactivity to at least one HIV-1 glycoprotein (such as but not limited to gp 41 -45, < 
gp!60) and at least three other virus-specific bands. 

5 

In accordance with the present invention, a non-pathogenic HIV-1 isolate is also c. 
of HIV-1 which: 

(i) induces an immune response in a human or primate subject; and 

(ii) does not substantially reduce proliferative responses or cytokine product! . 
10 mitogen, alloantigen and/or recall antigen relative to a healthy, non-i 

subject. 

Preferably, the immune response such as to a glycoprotein, for example gp41-45, g h . 
and/or gpl60. Preferably, the cytokine monitored is an interleukin, such as I 
15 Preferably, the recall antigen is influenza or tetanus toxoid. A non-pathogenic F TT 
isolate is also one which: 

(iii) does not substantially alter proliferative responses or cytokine producti 
allorgeneic mononuclear cells. 

20 The genomes or complementary DNA therefrom of the non-pathogenic H3V-1 i?-' 
of the present invention are capable of hybridising under medium stringency condL 
to the corresponding genome or complementary DNA of a pathogenic strain of I 
(e.g. HIV-1 strain NL4-3). The ability to hybridise to a pathogenic strain of HIV-1 
applies to a comparison of the entire genome/complementary DNA of a non-patho? 

25 strain or a fragment which includes genetic material corresponding to a region 1 
genome 3' of the nef gene in a pathogenic strain of HTV-L 

For the purposes of reference only, a suitable genomic nucleotide sequence frc . 
pathogenic HIV-1 strain is set forth in SEQ ID NO: 1 from HIV-1 strain NL4-3 (M 
30 etal 1992): 

1 TGGAAGGGCTAATTTGGTCCCAAAAAAGACAAGAGATCCTTGATCTC 
5 1 ATCTACCACACACAAGGCTACTTCCCTGATTGGCAGAACTACACACCA' 
101 GCCAGGGATCAGATATCCACTGACCTTTGGATGGTGCTTCAAGTTAGT^ 
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151 CAGTTGAACCAGAGCAAGTAGAAGAGGCCAAATAAGGAGAGAAGAACAGC 
201 TTGTTACACCCTATGAGCCAGCATGGGATGGAGGACCCGGAGGGAGAAGT 
2 51 ATTAGTGTGGAAGTTTGACAGCCTCCTAGCATTTCGTCACATGGCCCGAG 
301 AGCTGCATCCGGAGTACTACAAAGACTGCTGACATCGAGCTTTCTACAAG 
5 351 GGACTTTCCGCTGGGGACTTTCCAGGGAGGTGTGGCCTGGGCGGGACTGG 

401 GGAGTGGCGAGCCCTCAGATGCTACATATAAGCAGCTGCTTTTTGCCTGT 
451 ACTGGGTCTCTCTGGTTAGACCAGATCTGAGCCTGGGAGCTCTCTGGCTA 
501 ACTAGGGAACCCACTGCTTAAGCCTCAATAAAGCTTGCCTTGAGTGCTCA 
551 AAGTAGTGTGTGCCCGTCTGTTGTGTGACTCTGGTAACTAGAGATCCCTC 
10 601 AGACCCTTTTAGTCAGTGTGGAAAATCTCTAGCAGTGGCGCCCGAACAGG 

651 GACTTGAAAGCGAAAGTAAAGCCAGAGGAGATCTCTCGACGCAGGACTCG 
701 GCTTGCTGAAGCGCGCACGGCAAGAGGCGAGGGGCGGCGACTGGTGAGTA 
751 CGCCAAAAATTTTGACTAGCGGAGGCTAGAAGGAGAGAGATGGGTGCGAG 
801 AGCGTCGGTATTAAGCGGGGGAGAATTAGATAAATGGGAAAAAATTCGGT 
15 851 TAAGGCCAGGGGGAAAGAAACAATATAAACTAAAACATATAGTATGGGCA 

901 AGCAGGGAGCTAGAACGATTCGCAGTTAATCCTGGCCTTTTAGAGACATC 
951 AGAAGGCTGTAGACAAATACTGGGACAGCTACAACCATCCCTTCAGACAG 
1001 GATCAGAAGAACTTAGATCATTATATAATACAATAGCAGTCCTCTATTGT 
1051 GTGCATCAAAGGATAGATGTAAAAGACACCAATO 
20 1101 AGAGGAAGAGCAAAACAAAAGTAAGAAAAAGGCACAGCAAGCAGCAGCTG 

1151 ACACAGGAAACAACAGCCAGGTCAGCCAAAATTACCCTATAGTGCAGAAC 
1201 CTCCAGGGGCAAATGGTACATCAGGCCATATCACCTAGAACTTTAAATGC 
1251 ATGGGTAAAAGTAGTAGAAGAGAAGGCTTTCAGCCCAGAAGTAATACCCA 
1301 TGTTTTCAGCATTATCAGAAGGAGCCACCCCACAAGATTTAAATACCATG 
25 1351 CTAAACACAGTGGGGGGACATCAAGCAGCCATGCAAATGTTAAAAGAGAC 

1401 CATCAATGAGGAAGCTGCAGAATGGGATAGATTGCATCCAGTGCATGCAG 
1451 GGCCTATTGCACCAGGCCAGATGAGAGAACCAAGGGGAAGTGACATAGCA 
1501 GGAACTACTAGTACCCTTCAGGAACAAATAGGATGGATGACACATAATCC 
1551 ACCTATCCCAGTAGGAGAAATCTATAAAAGATGGATAATCCTGGGATTAA 
30 1601 ATAAAATAGTAAGAATGTATAGCCCTACCAGCATTCTGGACATAAGACAA 

1651 GGACCAAAGGAACCCTTTAGAGACTATGTAGACCGATTCTATAT^AACTCT 
1701 AAGAGCCGAGCAAGCTTCACAAGAGGTAAAAAATTGGATGACAGAAACCT 
1751 TGTTGGTCCAAAATGCGAACCCAGATTGTAAGACTATTTTAAAAGCATTG 
1801 GGACCAGGAGCGACACTAGAAGAAATGATGACAGCATGTCAGGGAGTGGG 
35 1851 GGGACCCGGCCATAAAGCAAGAGTTTTGGCTGAAGCAATGAGCCAAGTiVA 

1901 CAAATCCAGCTACCATAATGATACAGAAAGGCAATTTTAGGAACCAAAGA 
1951 AAGACTGTTAAGTGTTTCAATTGTGGCAAAGAAGGGCACATAGCCAAAAA 
2001 TTGCAGGGCCCCTAGGAAAAAGGGCTGTTGGAAATGTGGAAAGGAAGGAC 
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2051 ACCAAATGAAAGATTGTACTGAGAGACAGGCTAATTTTTTAGGGAAGATC 
2101 TGGCCTTCCCACAAGGGAAGGCCAGGGAATTTTCTTCAGAGCAGACCAGA 
2151 GC(^CAGCCCCACCAGAAGAGAGCTTCAGGTTTGGGGAAGAGACAACAA 
2201 CTCCCTCTCAGAAGCAGGAGCCGATAGACAAGGAACTGTATCCTTTAGCT 
2251 TCCCTCAGATCACTCTTTGGCAGCGACCCCTCGTCACAATAAAGATAGGG 
2301 GGGCAATTAAAGGAAGCTCTATTAGATACAGGAGCAGATGATACAGTATT 
2351 AGAAGAAATGAATTTGCCAGGAAGATGGAAACCAAAAATGATAGGGGGAA 
2401 TTGGAGGTTTTATCAAAGTAGGACAGTATGATCAGATACTCATAGAAATC 
2451 TGCGGACATAAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGTCAA 
2501 CATAATTGGAAGAAATCTGTTGACTCAGATTGGCTGCACTTTAAATTTTC 
2551 CCATTAGTCCTATTGAGACTGTACCAGTAAAATTAAAGCCAGGAATGGAT 
2601 GGCCCAAAAGTTAAACAATGGCCATTGACAGAAGAAAAAATAAAAGC^TT 
2651 AGTAGAAATTTGTACAGAAATGGAAAAGGAAGGAAAAATTTCAAAAATTG 
2701 GGCCTGAAAATCCATACAATACTCCAGTATTTGCCATAAAGAAA7^AAGAC 
275 1 AGTACTAAATGGAGAAAATTAGTAGATTTCAGAGAACTTAATAAGAGAAC 
2801 TCAAGATTTCTGGGAAGTTCAATTAGGAATACCACATCCTGCAGGGTTAA 
2851 AACAGAAAAAATCAGTAACAGTACTGGATGTGGGCGATGCATATTTTTCA 
2901 GTTCCCTTAGATAAAGACTTCAGGAAGTATACTGCATTTACCATACCTAG 
2951 TATAAACAATGAGACACCAGGGATTAGATATCAGTACAATGTGCTTCCAC 
; 3001 AGGGATGGAAAGGATCACCAGCAATATTCC^ 
3051 TTAGAGCCTTTTAGAAAACAAAATCCAGACATAGTCATCTATCAATACAT 
3101 GGATGATTTGTATGTAGGATCTGACTTAGAAATAGGGCAGCATAGAACAA 
3151 AAATAGAGGAACTGAGACAACATCTGTTGAGGTGGGGATTTACCACACCA 
3201 GACAAAAAACATCA.GAAAGAACCTCCATTCCTTTGGATGGGTTATGAACT 
3251 CCATCCTGATAAATGGACAGTACAGCCTATAGTGCTGCCAGAAAAGGACA 
3301 GCTGGACTGTCAATGACATACAGAAATTAGTGGGAAAATTGAATTGGGCA 
3351 AGTCAGATTTATGCAGGGATTAAAGTAAGGCAATTATGTAAACTTCTTAG 
3401 GGG AACCAAAGCACTAAGAGAAGTAGTAC 

3451 TAGAACTGGCAGAAAACAGGGAGATTCTAAAAGAACCGGTACATGGAGTG 
3501 TATTATGACCCATCAAAAGACTTAATAGCAGAAATACAGAAGCAGGGGCA 
3551 AGGCCAATGGACATATCAAATTTATCAAGAGCCATTTAAAAATCTGAAAA 
3601 CAGGAAAATATGCAAGAATGAAGGGTGCCCACACTAATGATGTGAAACAA 
3651 TTAACAGAGGCAGTACAAAAAATAGCCACAGAAAGCATAGTAATATGGGG 
3701 AAAGACTCCTAAATTTAAATTACCCATACAAAAGGAAACATGGGAAGCAT 
3751 GGTGGACAGAGTATTGGCAAGCCACCTGGATTCCTGAGTGGGAGTTTGTC 
3801 AATACCCCTCCCTTAGTGAAGTTATGGTACCAGTTAGAGAAAGAACCCAT 
3851 AATAGGAGCAGAAACTTTCTATGTAGATGGGGCAGCCAATAGGGAAACTA 
3901 AATTAGGAAAAGCAGGATATGTAACTGACAGAGGAAGACAAAAAGTTGTC 
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3951 CCCCTAACGGACACAACAAATCAGAAGACTGAGTTACAAGCAATTCATCT 
4001 AGCTTTGCAGGATTCGGGATTAGAAGTAAACATAGTGACAGACTCACAAT 
4051 ATGCATTGGGAATCATTCAAGCACAACCAGATAAGAGTGAATCAGAGTTA 
4101 GTCAGTCAAATAATAGAGCAGTTAATAAAAAAGGAAAAAGTCTACCTGGC 
4151 ATGGGTACCAGCACACAAAGGAATTGGAGGAAATGAACAAGTAGATGGGT 
4201 TGGTCAGTGCTGGAATCAGGAAAGTACTATTTTTAGATGGAATAGATAAG 
4251 GCCCAAGAAGAACATGAGAAATATCACAGTAATTGGAGAGCAATGGCTAG 
4301 TGATTTTAACCTACCACCTGTAGTAGCAAAAGAAATAGTAGCCAGCTGTG 
4351 ATAAATGTCAGCTAAAAGGGGAAGCCATGCATGGACAAGTAGACTGTAGC 
4401 CCAGGAATATGGCAGCTAGATTGTACACATTTAGAAGGAAAAGTTATCTT 
4451 GGTAGCAGTTCATGTAGCCAGTGGATATATAGAAGCAGAAGTAATTCCAG 
4501 CAGAGACAGGGCAAGAAACAGCATACTTCCTCTTAAAATTAGCAGGAAGA 
4551 TGGCCAGTAAAAACAGTACATACAGACAATGGCAGCAATTTCACCAGTAC 
4601 TACAGTTAAGGCCGCCTGTTGGTGGGCGGGGATCAAGCAGGAATTTGGCA 
4651 TTCCCTACAATCCCCAAAGTCAAGGAGTAATAGAATCTATGAATAAAGAA 
4701 TTAAAGAAAATTATAGGACAGGTAAGAGATCAGGCTGAACATCTTAAGAC 
4751 AGCAGTACAAATGGCAGTATTCATCCACAATTTTAAAAGAAAAGGGGGGA 
4801 TTGGGGGGTACAGTGCAGGGGAAAGAATAGTAGACATAATAGCAACAGAC 
4851 ATACAAACTAAAGAATTACAAAAACAAATTACAAAAAT^ 
4901 GGTTTATTACAGGGACAGCAGAGATCCAGTTTGGAAAGGACCAGCAAAGC 
4951 TCCTCTGGAAAGGTGAAGGGGCAGTAGTAATACAAGATAATAGTGACATA 
5001 AAAGTAGTGCCAAGAAGAAAAGCAAAGATCATCAGGGATTATGGAAAACA 
5051 GATGGCAGGTGATGATTGTGTGGCAAGTAGACAGGATGAGGATTAACACA 
5101 TGGAAAAGATTAGTAAAACACCATATGTATATTTCAAGGAAAGCTAAGGA 
5151 CTGGTTTTATAGACATCACTATGAAAGTACTAATCCAAAAATAAGTTCAG 
5201 AAGTACACATCCCACTAGGGGATGCTAAATTAGTAATAACAACATATTGG 
5251 GGTCTGCATACAGGAGAAAGAGACTGGCATTTGGGTCAGGGAGTCTCCAT 
5301 AGAATGGAGGAAAAAGAGATATAGCACACAAGTAGACCCTGACCTAGCAG 
5351 ACCAACTAATTCATCTGCACTATTTTGATTGTTTTTCAGAATCTGCTATA 
5401 AGAAATACCATATTAGGACGTATAGTTAGTCCTAGGTGTGAATATCAAGC 
5451 AGGACATAACAAGGTAGGATCTCTACAGTACTTGGCACTAGCAGCATTAA 
5501 TAAAACCAAAACAGATAAAGCCACCTTTGCCTAGTGTTAGGAAACTGACA 
5551 GAGGACAGATGGAACAAGCCCCAGAAGACCAAGGGCCACAGAGGGAGCCA 
5601 TACAATGAATGGACACTAGAGCTTTTAGAGGAACTTAAGAGTGAAGCTGT 
5651 TAGAC^TTTTCCTAGGATATGGCTCCATAACTTAGGACAACATATCTATG 
5701 AAACTTACGGGGATACTTGGGCAGGAGTGGAAGCCATAATAAGAATTCTG 
5751 CAACAACTGCTGTTTATCCATTTCAGAATTGGGTGTCGACATAGCAGAAT 
5801 AGGCGTTACTCGACAGAGGAGAGCAAGAAATGGAGCCAGTAGATCCTAGA 
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5851 CTAGAGCCCTGGAAGCATCCAGGAAGTCAGCCTAAAACTGCTTGt ACCAA 
5901 TTGCTATTGTAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATGACAA 
5951 AAGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGA 
6001 GCTCATCAGAACAGTCAGACTCATCAAGCTTCTCTATCAAAGCAGTAAGT 
6051 AGTACATGTAATGCAACCTATAATAGTAGCAATAGTAGCATTAGTAGTAG 
6101 CAATAATAATAGCAATAGTTGTGTGGTCCATAGTAATCATAGAATATAGG 
6151 AAAATATTAAGACAAAGAAAAATAGACAGGTTAATTGATAGACTAATAGA 
6201 AAGAGCAGAAGACAGTGGCAATGAGAGTGAAGGAGAAGTATCAGCACTTG 
6251 TGGAGATGGGGGTGGAAATGGGGCACCATGCTCCTTGGGATATTGATGAT 
6301 CTGTAGTGCTACAGAAAAATTGTGGGTCACAGTCTATTATGGGGTACCTG 
6351 TGTGGAAGGAAGCAACCACCACTCrrATTTTGTGCATCAGATGCTAAAGCA 
6401 TATGATACAGAGGTACATAATGTTTGGGCCACACATGCCTGTGTACCCAC 
6451 AGACCCCAACCCACAAGAAGTAGTATTGGTAAATGTGACAGAAAATTTTA 
6501 ACATGTGGAAAAATGACATGGTAGAACAGATGCATGAGGATATAATCAGT 
6551 TTATGGGATCAAAGCCTAAAGCCATGTGTAAAATTAACCCCACTCTGTGT 
6601 TAGTTTAAAGTGCACTGATTTGAAGAATGATACTAATACCAATAGTAGTA 
6651 GCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAACTGCTCTTTCAAT 
6701 ATCAGCACAAGCATAAGAGATAAGGTGCAGAAAGAATATGCATTCTTTTA 
6751 TAAACTTGATATAGTACCAATAGATAATACCAGCTATAGGTTGATAAGTT 
6801 GTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCCTTTGAGCCA 
6851 ATTCCC7VTACATTATTGTGCCGCGGCTGGTTTTGCGATTCTAAAATGTAA 
6901 TAATAAGACGTTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTAC 
6951 AATGTACACATGGAATCAGGCC^GTAGTATCAACTCAACTGCTGTTAAAT 
7001 GGCAGTCTAGCAGAAGAAGATGTAGTAATTAGATCTGCCAATTTCACAGA 
7051 CAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATT 
7101 GTACAAGACCCAAC^^CAATACAAGAAAAAGTATCCGTATCCAGAGGGGA 
7151 CCAGGGAGAGCATTTGTTACAATAGGAAT^AATAGGAAATATGAGACAAGC 
7201 ACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAG 
7251 CTAGCAAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAG 
7301 CAATCCTCAGGAGGGGACCCAGAAATTGTAACGCACAGTTTTAATTGTGG 
7351 AGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGTACTTGGT 
7401 TTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGAC 
7451 ACAATCACACTCCCATGCAGAATAAAACAATTTATAAACATGTGGCAGGA 
7501 AGTAGGAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAGATGTT 
7551 CATCAAATATTACTGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 
7601 AATGGGTCCGAGATCTTCAGACCTGGAGGAGGCGATATGAGGGACAATTG 
7651 GAGAAGTGAATTATATAAATATAAAGTAGTAAAAATTGAACCATTAGGAG 
7701 TAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGAGAAAAAAGAGCA 
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7751 GTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCAC 
7 8 01 TATGGGCTGCACGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGT 
7851 CTGATATAGTGCAGCAGCAGAACAATTTGCTGAGGGCTATTGAGGCGCAA 
7901 CAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAACAGCTCCAGGCAAG 
5 7951 AATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTT 

8001 GGGGTTGCTCTGGAAAACTCATTTGCACCACTGCTGTGCCTTGGAATGCT 
8051 AGTTGGAGTAATAAATCTCTGGAACAGATTTGGAATAACATGACCTGGAT 
8101 GGAGTGGGACAGAGAAATTAACAATTACACAAGCTTAATACACTCCTTAA 
8151 TTGAAGAATCGCAAAACCAGCAAGAAAAGAATGAACAAGAATTATTGGAA 

10 8201 TTAGATAAATGGGCAAGTTTGTGGAATTGGTTTAACATAACAAATTGGCT 

8251 GTGGTATATAAAATTATTCATAATGATAGTAGGAGGCTTGGTAGGTTTAA 
8301 GAATAGTTTTTGCTGTACTTTCTATAGTGAATAGAGTTAGGCAGGGATAT 
8351 TCACCATTATCGTTTCAGACCCACCTCCCAATCCCGAGGGGACCCGACAG 
8401 GCCCGAAGGAATAGAAGAAGAAGGTGGAGAGAGAGACAGAGACAGATCCA 

15 8451 TTCGATTAGTGAACGGATCCTTAGCACTTATCTGGGACGATCTGCGGAGC 

8501 CTGTGCCTCTTCAGCTACCACCGCTTGAGAGACTTACTCTTGATTGTAAC 
8551 GAGGATTGTGGAACTTCTGGGACGCAGGGGGTGGGAAGCCCTCAAATATT 
8601 GGTGGAATCTCCTACAGTATTGGAGTCAGGAACTAAAGAATAGTGCTGTT 
8651 AACTTGCTCAATGCCACAGCCATAGCAGTAGCTGAGGGGACAGATAGGGT 

20 8701 TATAGAAGTATTACAAGCAGCTTATAGAGCTATTCGCCACATACCTAGAA 

8751 GAATAAGACAG^CTTGGAAAGGATTTTGCTATAAGATGGGTGGCAAGTG 
8801 GTCAAAAAGTAGTGTGATTGGATGGCCTGCTGTAAGGGAAAGAATGAGAC 
8851 GAGCTGAGCCAGCAGCAGATGGGGTGGGAGCAGTATCTCGAGACCTAGAA 
8901 AAACATGGAGCAATCACAAGTAGCAATACAGCAGCTAACAATGCTGCTTG 

25 8951 TGCCTGGCTAGAAGCACAAGAGGAGGAAGAGGTGGGTTTTCCAGTCACAC 

9001 CTCAGGTACCTTTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGC 
9051 CACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATT 
9101 AAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCC 
9151 CTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACC 

30 9201 TTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGA 

9251 GGCCAATAAAGGAGAGAAC^CCAGCTTGTTACACCCTGTGAGCCTGCATG 
9301 GAATGGATGACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGC 
9351 CTAGCATTTCATCACGTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAA 
9401 CTGCTGACATCGAGCTTGCTACAAGGGACTTTCCGCTGGGGACTTTCCAG 

35 9451 GGAGGCGTGGCCTGGGCGGGACTGGGGAGTGGCGAGCCCTCAGATGCTGC 

9501 ATATAAGCAGCTGCTTTTTGCCTGTACTGGGTCTCTCTGGTTAGACCAGA 
9551 TCTGAGCCTGGGAGCTCTCTGGCTAACTAGGGAACCCACTGCTTAAGCCT 
9601 CAATAAAGCTTGCCTTGAGTGCTTCAAGTAGTGTGTGCCCGTCTGTTGTG 
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9651 TGACTCTGGTAACTAGAGATCCCTCAGACCCTTTTAGTCAGTGTGGAi\AA 
9701 TCTCTAGCA 

However, for the purposes of comparing the nucleotide sequences of non-pathogenic 
5 HIV-1 strains including the ability to hybridise to a reference strain, the present 
invention extends to a genomic nucleotide sequence from any pathogenic strain of HIV- 
1. 

Accordingly, in a particularly preferred embodiment, there is provided a viral isolate 
10 which: 

(i) carries a genome which is capable of hybridising under medium stringency 
conditions to SEQ ID NO: 1 or a complementary form thereof or an analogous 
sequence from another pathogenic strain of HTV-1; and 

(ii) carries a deletion mutation in a region corresponding to the rief gene and/or in 
15 an LTR region. Generally, such an HTV-1 isolate is non-pathogenic as 

hereinbefore defined. 

In a related embodiment, there is provided an isolated virus which: 

(i) has a genome which is capable of hybridising under medium stringency 
20 conditions to complementary nucleic acid from a pathogenic strain of HIV-1; 

and 

(ii) carries one or more deletion mutations in a region of its genome corresponding 
to a region which contains nef coding sequences and LTR nucleotide sequences. 

25 For the purposes of defining the level of stringency, reference can conveniently be made 
to Maniatis et al (1982) at pages 387-389 which is herein incorporated by reference 
where the washing steps disclosed are considered high stringency. A low stringency is 
defined herein as being in 1-3X SSC/0.1-0.5% w/v SDS at 37-50°C for 2-3 hours. 
Depending on the source and concentration of nucleic acid involved in the hybridisation, 

30 alternative conditions of stringency may be employed such as medium stringent 
conditions which are considered herein to be 0.1-1X SSC/0.25-0.5% w/v SDS at £ 45°C 
for 2-3 hours or high stringent conditions considered herein to be 0.1-1X SSC/0.1% w/v 
SDS at 60°C for 1-3 hours. 
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In a particularly preferred embodiment of the present invention, the non-pathogenic 
strain of HIV- 1 carries a mutation in the nef gene and/or LTR region of the genome. 



A "mutation" is considered herein to include a single or multiple nucleotide substitution, 
5 deletion and/or addition. Most preferred mutations are single or multiple deletions of 
at least one, most preferably at least ten and even more preferably at least twenty 
contiguous nucleotides from a region corresponding to the nef gene and/or the LTR 
region. When the non-pathogenic virus carries a mutation in the LTR region, this 
generally occurs 5' of the Spl sites. 

10 

According to a preferred aspect of the present invention, there is provided a viral isolate 
which: 

(i) is reactive to antibodies to a glycoprotein from HIV-1 such as at least one of 
gp41-45, gpl20 and/or gpl60; 

15 (ii) is substantially non-pathogenic in human subjects; and 

(iii) carries a deletion mutation of at least ten nucleotides in a region corresponding 
to the ne/gene and/or LTR region of a pathogenic strain of HIV-1 . 

In another embodiment, there is provided a viral isolate which: 
20 (i) is capable of inducing an immune response to at least one of gag pol and/or em; 

(ii) is substantially non-pathogenic in human subjects; and 

(iii) carries a deletion mutation of at least ten nucleotides in a region corresponding 
to the «e/gene and/or LTR region of a pathogenic strain of HIV-1. 

25 Preferably, in respect of the latter embodiment, the immune response is an antibody or 
a cell mediated response. In a most preferred embodiment, the immune response is a 
humoral immune response. 

The nucleotide sequence of the nef gene in HIV-1 NL4-3 is defined in SEQ ID NO: 
30 650: 

ATGGGTGGCAAGTGGTCAAAAAGTAGTGTGATTGGATGGCCTGCTGTAAGGGAAAGAAT 
GAGAQGAGCTGAGCCAGC^GCAGATGGGGTGGGAGC^GTATCTCGAGACCTAGAAAAAC 
ATGGAGCAATCACAAGTAGCAATACAGCAGCTAACAATGCT^ 
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GCACAAGAGGAGGAAGAGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAAT 
GACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAG 
GGCTAATTCACTCCCAAAGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAA 
GGCTACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGAC 
5 CTTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATA 
AAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGAATGGATGACCCTGAG 
AGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCACGTGGCCCGAGA 
GCTGCATCCGGAGTACTTCAAGAACTGCTGA 

10 The present invention extends to any or all single or multiple nucleotide deletions to a 
contiguous series of at least ten nucleotides from the nef gene which render the strain 
avirulent. The deletions may encompass the entire gene or parts thereof and may 
represent a single deletion or two or more deletions. Put in alternative terms, the non- 
pathogenic HTV-1 isolates of the present invention comprise a nucleotide sequence at the 

IS corresponding nef gene region non-identifiable to SEQ ID NO: 650, said non-identity 
comprising at least 5%, more preferably at least 10% and even more preferably at least 
20% variation thereon. 

In a preferred embodiment, therefore, the present invention contemplates a viral isolate 
20 which: 

(i) is reactive to antibodies to a glycoprotein from HTV-1 such as at least one of 
gp41-45, gpl20 and/or gpl60; 

(ii) carries a genome or a part or fragment thereof capable of hybridising under 
medium stringency conditions to a nucleotide sequence as set forth in SEQ ID 

25 NO: 1 or a complementary form thereof or an analogous sequence from another 

pathogenic strain of HTV-1; 

(iii) carries a deletion of at least ten nucleotides in a region corresponding to the nef 
gene in HTV-1 NL4-3; and 

wherein said deletion encompasses one or more of the following decanucleotides from 
30 the nef gene of HIV- 1 NL4-3 or corresponding sequences from another pathogenic strain 
ofHIV-1: 

ATGGGTGGCA ( SEQ ID NO: 2); TGGGTGGCAA ( SEQ ID NO: 3) ; 
GGGTGGCAAG ( SEQ ID NO: 4); GGTGGCAAGT ( SEQ ID NO: 5) ; 
GTGGCAAGTG (SEQ ID NO: 6); TGGCAAGTGG ( SEQ ID NO: 7); 
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GGCAAGTGGT (SEQ ID NO: 8) ; 
CAAGTGGTCA (SEQ ID NO: 10) 
AGTGGTCAAA (SEQ ID NO: 12) 
TGGTCAAAAA (SEQ ID NO: 14) 
5 GTCAAAAAGT (SEQ ID NO: 16) 
CAAAAAGTAG (SEQ ID NO: 18) 
AAAAGTAGTG ( SEQ ID NO: 20) 
AAGTAGTGTG (SEQ ID NO: 22) 
GTAGTGTGAT ( SEQ ID NO: 24) 
10 AGTGTGATTG (SEQ ID NO: 26) 
TGTGATTGGA { SEQ ID NO: 28) 
TGATTGGATG ( SEQ ID NO: 30) 
ATTGGATGGC (SEQ ID NO: 32) 
TGGATGGCCT (SEQ ID NO: 34) 
15 GATGGCCTGC (SEQ ID NO: 36) 
TGGCCTGCTG (SEQ ID NO: 38) 
GCCTGCTGTA (SEQ ID NO: 40) 
CTGCTGTAAG (SEQ ID NO: 42) 
GCTGTAAGGG (SEQ ID NO: 44) 
20 TGTAAGGGAA ( SEQ ID NO: 46) 
TAAGGGAAAG (SEQ ID NO: 48), 
AGGGAAAGAA (SEQ ID NO: 50) , 
GGAAAGAATG (SEQ ID NO: 52), 
AAAGAATGAG (SEQ ID NO: 54) ; 
25 AGAATGAGAC (SEQ ID NO: 56) ; 
AATGAGACGA (SEQ ID NO: 58); 
TGAGACGAGC ( SEQ ID NO: 60); 
AGACGAGCTG (SEQ ID NO: 62); 
ACGAGCTGAG (SEQ ID NO: 64); 
30 GAGCTGAGCC (SEQ ID NO: 66); 
GCTGAGCCAG (SEQ ID NO: 68); 
TGAGCCAGCA (SEQ ID NO: 70); 
AGCCAGCAGC (SEQ ID NO: 72); 
CCAGCAGCAG ( SEQ ID NO: 74); 
35 AGCAGCAGAT (SEQ ID NO: 76); 
CAGCAGATGG (SEQ ID NO: 78); 
GCAGATGGGG ( SEQ ID NO: 80); 
AGATGGGGTG (SEQ ID NO: 82); 
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ATGGGGTGGG (SEQ ID NO: 84) j 
GGGGTGGGAG (SEQ . ID NO: 86) j 
GGTGGGAGCA (SEQ ID NO: 88) ; 
TGGGAGCAGT (SEQ ID NO: 90); 
5 GGAGCAGTAT (SEQ ID NO: 92); 
AGCAGTATCT (SEQ ID NO: 94); 
CAGTATCTCG (SEQ ID NO: 96); 
GTATCTCGAG (SEQ ID NO: 98); 
ATCTCGAGAC (SEQ ID NO: 100) 
10 CTCGAGACCT (SEQ ID NO: 102) 
CGAGACCTAG ( SEQ ID NO: 104) 
AGACCTAGAA (SEQ ID NO: 106) 
ACCTAGAAAA ( SEQ ID NO: 108) 
CTAGAAAAAC (SEQ ID NO: 110) 

.15 AGAAAAACAT ( SEQ ID NO: 112) 
AAAAACATGG ( SEQ ID NO: 114) 
AAACATGGAG ( SEQ ID NO: 116) 
ACATGGAGCA (SEQ ID NO: 118) 
ATGGAGCAAT ( SEQ ID NO: 120) 

20 GGAGCAATCA (SEQ ID NO: 122) 
AGCAATCACA (SEQ ID NO: 124) 
CAATCACAAG (SEQ ID NO: 126) 
ATCACAAGTA ( SEQ ID NO: 128) 
CACAAGTAGC (SEQ ID NO: 130) 

25 CAAGTAGCAA (SEQ ID NO: 132) 
AGTAGCAATA (SEQ ID NO: 134) 
TAGCAATACA (SEQ ID NO: 136) 
GCAATACAGC (SEQ ID NO: 138) 
AATACAGCAG (SEQ ID NO: 140) 

30 TACAGCAGCT (SEQ ID NO: 142) 
CAGCAGCTAA ( SEQ ID NO: 144) 
GCAGCTAACA ( SEQ ID NO: 146) 
AGCTAACAAT (SEQ ID NO: 148) 
CTAACAATGC (SEQ ID NO: 150) 

35 AACAATGCTG (SEQ ID NO: 152) 
CAATGCTGCT (SEQ ID NO: 154) 
ATGCTGCTTG (SEQ ID NO: 156) 
GCTGCTTGTG (SEQ ID NO: 158) 
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TGCTTGTGCC (SEQ ID NO: 160 
CTTGTGCCTG (SEQ ID NO: 162 
TGTGCCTGGC (SEQ ID NO: 164 
TGCCTGGCTA (SEQ ID NO: 166 
5 CCTGGCTAGA (SEQ ID NO: 168 
TGGCTAGAAG (SEQ ID NO: 170 
GCTAGAAGCA ( SEQ ID NO: 172! 
TAGAAGCACA ( SEQ ID NO: 174! 
GAAGCACAAG (SEQ ID NO: 176! 
10 AGCACAAGAG (SEQ ID NO: 178] 
CACAAGAGGA ( SEQ ID NO: 180] 
CAAGAGGAGG (SEQ ID NO: 182] 
AGAGGAGGAA (SEQ ID NO: 184] 
AGGAGGAAGA (SEQ ID NO: 186] 

15 GAGGAAGAGG (SEQ ID NO: 188) 
GGAAGAGGTG (SEQ ID NO: 190) 
AAGAGGTGGG ( SEQ ID NO: 192) 
GAGGTGGGTT (SEQ ID NO : 194 ) 
GGTGGGTTTT (SEQ ID NO : 196 ) 

20 TGGGTTTTCC (SEQ ID NO: 198) 
GGTTTTCCAG ( SEQ ID NO : 200) 
TTTTCCAGTC (SEQ ID NO: 202) 
TTCCAGTCAC (SEQ ID NO: 204) 
CCAGTCACAC(SEQ ID NO: 206) 

25 AGTCACACCT (SEQ ID NO: 208) 
TCACACCTCA(SEQ ID NO: 210) 
ACACCTCAGG (SEQ ID NO: 212) 
ACCTCAGGTA (SEQ ID NO: 214) 
CTCAGGTACC (SEQ ID NO: 216) 

30 CAGGTACCTT { SEQ ID NO: 218) 
GGTACCTTTA ( SEQ ID NO: 220) 
TACCTTTAAG (SEQ ID NO: 222) 
CCTTTAAGAC (SEQ ID NO: 224) 
TTTAAGACCA (SEQ ID NO: 226) 

35 TAAGACCAAT (SEQ ID NO: 228) 
AGACCAATGA ( SEQ ID NO: 230) 
ACCAATGACT (SEQ ID NO: 232) 
CAATGACTTA (SEQ ID NO: 234) 
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ATGACTTACA (SEQ ID NO: 236 
GACTTACAAG (SEQ ID NO: 238 
CTTACAAGGC (SEQ ID NO: 240 
TACAAGGCAG (SEQ ID NO: 242 
5 CAAGGCAGCT (SEQ ID NO: 244 
AGGCAGCTGT (SEQ ID NO: 246 
GCAGCTGTAG (SEQ ID NO: 248! 
AGCTGTAGAT (SEQ ID NO: 250! 
CTGTAGATCT (SEQ ID NO: 252! 
10 GTAGATCTTA (SEQ ID NO: 254! 
AGATCTTAGC (SEQ ID NO: 256! 
ATCTTAGCCA (SEQ ID NO: 258] 
CTTAGCCACT (SEQ ID NO: 260] 
TAGCCACTTT (SEQ ID NO: 262] 

15 GCCACTTTTT (SEQ ID NO: 264] 
CACTTTTTAA (SEQ ID NO: 266] 
CTTTTTAAAA ( SEQ ID NO: 268] 
TTTTAAAAGA(SEQ ID NO: 270] 
TTAAAAGAAA ( SEQ ID NO: 272] 

20 AAAAGAAAAG ( SEQ ID NO: 274] 
AAGAAAAGGG ( SEQ ID NO: 276) 
GAAAAGGGGG ( SEQ ID NO: 278) 
AAAGGGGGGA ( SEQ ID NO: 280) 
AGGGGGGACT (SEQ ID NO: 282) 

25 GGGGGACTGG ( SEQ ID NO: 284) 
GGGACTGGAA ( SEQ ID NO: 286) 
GACTGGAAGG ( SEQ ID NO: 288) 
CTGGAAGGGC (SEQ ID NO: 290) 
GGAAGGGCTA ( SEQ ID NO: 292) 

30 AAGGGCTAAT (SEQ ID NO: 294) 
GGGCTAATTC (SEQ ID NO: 296) 
GCTAATTCAC (SEQ ID NO: 298) 
TAATTCACTC (SEQ ID NO: 300) 
ATTCACTCCC(SEQ ID NO: 302) 

35 TCACTCCCAA(SEQ ID NO: 304) 
ACTCCCAAAG (SEQ ID NO: 306) 
TCCCAAAGAA (SEQ ID NO: 308) 
CCAAAGAAGA (SEQ ID NO: 310) 
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AAAGAAGACA (SEQ ID NO: 312) 
AGAAGACAAG (SEQ ID NO: 314) 
AAGACAAGAT (SEQ ID NO: 316) 
GACAAGATAT (SEQ ID NO: 318) 
5 CAAGATATCC(SEQ ID NO: 320) 
AGATATCCTT (SEQ ID NO: 322) 
ATATCCTTGA (SEQ ID NO: 324) 
ATCCTTGATC (SEQ ID NO: 326) 
CCTTGATCTG (SEQ ID NO: 328) 
10 TTGATCTGTG (SEQ ID NO: 330) 
GATCTGTGGA ( SEQ ID NO: 332) 
TCTGTGGATC (SEQ ID NO: 334) 
TGTGGATCTA ( SEQ ID NO: 336) 
TGGATCTACC (SEQ ID NO: 338) 
15 GATCTACCAC (SEQ ID NO : 34 0 ) 
TCTACCACAC(SEQ ID NO: 342), 
TACCACACAC(SEQ ID NO: 344) 
CCACACACAA(SEQ ID NO: 346) , 
ACACACAAGG (SEQ ID NO: 348) ; 
20 ACACAAGGCT (SEQ ID NO : 350) ; 

ACAAGGCTAC (SEQ ID NO: 352) ; 

AAGGCTACTT (SEQ ID NO: 354) ; 

GGCTACTTCC (SEQ ID NO: 356); 

CTACTTCCCT(SEQ ID NO: 358); 
25 ACTTCCCTGA (SEQ ID NO: 360); 

TTCCCTGATT (SEQ ID NO: 362); 

CCCTGATTGG (SEQ ID NO: 364); 

CTGATTGGCA ( SEQ ID NO: 366); 

GATTGGCAGA ( SEQ ID NO : 368); 
30 TTGGCAGAAC (SEQ ID NO: 370); 

GGCAGAACTA ( SEQ ID NO: 372); 

CAGAACTACA ( SEQ ID NO: 374); 

GAACTACACA (SEQ ID NO: 376); 

ACTACACACC ( SEQ ID NO : 378); 
35 TACACACCAG (SEQ ID NO: 380); 

CACACCAGGG (SEQ ID NO: 382); 

CACCAGGGCC(SEQ ID NO: 384); 

CCAGGGCCAG (SEQ ID NO: 386); 
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r AAGAAGACAA ( SEQ ID NO: 313); 
r GAAGACAAGA ( SEQ ID NO: 315); 

AGACAAGATA ( SEQ ID NO: 317); 

ACAAGATATC ( SEQ ID NO: 319); 

AAGATATCCT (SEQ ID NO: 321); 

GATATCCTTG (SEQ ID NO: 323); 

TATCCTTGAT ( SEQ ID NO: 325); 

TCCTTGATCT (SEQ ID NO: 327); 

CTTGATCTGT (SEQ ID NO: 329); 

TGATCTGTGG (SEQ ID NO: 331); 

ATCTGTGGAT (SEQ ID NO: 333); 

CTGTGGATCT ( SEQ ID NO : 335); 

GTGGATCTAC (SEQ ID NO: 337); 

GGATCTACCA (SEQ ID NO: 339); 
ATCTACCACA (SEQ ID NO: 341); 

CTACCACACA(SEQ ID NO: 343); 
ACCACACACA(SEQ ID NO: 345) ; 
CACACACAAG (SEQ ID NO: 347); 
CACACAAGGC (SEQ ID NO: 349); 
GACAAGGCTA ( SEQ ID NO: 351) ; 
CAAGGCTACT (SEQ ID NO: 353); 
AGGCTACTTC (SEQ ID NO: 355); 
GCTACTTCCC(SEQ ID NO: 357); 
TACTTCCCTG ( SEQ ID NO: 359); 
CTTCCCTGAT (SEQ ID NO: 361); 
TCCCTGATTG (SEQ ID NO: 363); 
CCTGATTGGC (SEQ ID NO: 365); 
TGATTGGCAG ( SEQ ID NO : 367); 
ATTGGCAGAA (SEQ ID NO: 369); 
TGGCAGAACT (SEQ ID NO: 371); 
GCAGAACTAC (SEQ ID NO: 373); 
AGAACTACAC (SEQ ID NO: 375); 
AACTACACAC (SEQ ID NO: 377); 
CTACACACCA(SEQ ID NO: 379); 
ACACACCAGG (SEQ ID NO: 381); 
ACACCAGGGC (SEQ ID NO: 383); 
ACCAGGGCCA (SEQ ID NO: 385); 
CAGGGCCAGG (SEQ ID NO: 387); 
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AGGGCCAGGG (SEQ ID NO: 388 
GGCCAGGGGT (SEQ ID NO: 390 
CCAGGGGTCA ( SEQ ID NO: 392 
AGGGGTCAGA ( SEQ ID NO: 394 
5 GGGTCAGATA ( SEQ ID NO: 396 
GTCAGATATC ( SEQ ID NO: 398 
CAGATATCCA ( SEQ ID NO: 400 
GATATCCACT (SEQ ID NO: 402 
TATCCACTGA (SEQ ID NO: 404 
10 TCCACTGACC(SEQ ID NO: 406! 
CACTGACCTT (SEQ ID NO: 408! 
CTGACCTTTG (SEQ ID NO: 410] 
GACCTTTGGA { SEQ ID NO: 412] 
CCTTTGGATG (SEQ ID NO: 414] 

15 TTTGGATGGT (SEQ ID NO: 416] 
TGGATGGTGC (SEQ ID NO: 418] 
GATGGTGCTA ( SEQ ID NO: 420] 
TGGTGCTACA ( SEQ ID NO: 422] 
GTGCTACAAG (SEQ ID NO: 424] 

20 GCTACAAGCT (SEQ ID NO: 426] 
TACAAGCTAG (SEQ ID NO: 428] 
CAAGCTAGTA (SEQ ID NO: 430] 
AGCTAGTACC (SEQ ID NO: 432] 
CTAGTACCAG (SEQ ID NO: 434) 

25 AGTACCAGTT (SEQ ID NO: 436) 
TACCAGTTGA (SEQ ID NO: 438) 
CCAGTTGAGC (SEQ ID NO: 440) 
AGTTGAGCGA (SEQ ID NO: 442) 
TTGAGCCAGA(SEQ< ID NO: 444) 

30 GAGCCAGATA (SEQ ID NO: 446) 
GCCAGATAAG (SEQ ID NO: 448) 
CAGATAAGGT (SEQ ID NO: 450) 
GATAAGGTAG (SEQ ID NO: 452) 
TAAGGTAGAA ( SEQ ID NO: 454) 

35 AGGTAGAAGA (SEQ ID NO: 456) 
GTAGAAGAGG (SEQ ID NO: 458 ) 
AGAAGAGGCC ( SEQ ID NO: 460) 
AAGAGGCCAA (SEQ ID NO: 462) 
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ATTT CATPA P ( SEO 

X X X WAX W1U \ wX3\V, 


TD 

XXV 


NO • 






TTTCATCACG (SEQ 


ID 


NO: 


572) ; 


TTCATCACGT (SEO 

x xwxxwnwwx 


ID 

XXS 


NO* 


573 ) 




TCATCACGTG (SEO 


ID 


NO: 


574) ; 


CATCACGTGG (SEO 


ID 

XI/ 


NO* 

xnw • 


575) 




ATCACGTGGC (SEQ 


ID 


NO: 


576) : 


TCACGTGGCC f SEO 


in 

XXV 


NO* 


577) 


20 


CACGTGGCCC ( SEQ 


ID 


NO: 


578) ; 


ACGTGGCCCG (SEO 


ID 


NO: 


579) 




CGTGGCCCGA (SEQ 


ID 


NO: 


580) ; 


GTGGCCCGAG (SEO 


ID 


NO: 


581) 

m* O JL f 




TGGCCCGAGA ( SEQ 


ID 


NO: 


582) ; 


GGCCCGAGAG (SEO 


ID 


NO: 


583) 




GCCCGAGAGC (SEQ 


ID 


NO: 


584) ; 


CCCGAGAGCT (SEO 


ID 


NO: 


585) 

mJ W wJ J 




CCGAGAGCTG (SEQ 


ID 


NO: 


586) ; 


CGAGAGCTGC (SEO 


ID 


NO: 


587) 


25 


GAGAGCTGCA ( SEQ 


ID 


NO: 


588) ; 

*■* W W / / 


AGAGCTGCAT (SEO 


ID 


NO: 


589) 

•J O J / 




GAGCTGCATC (SEQ 


ID 


NO: 


590) ; 


AGCTGCATCC (SEO 


ID 


NO: 


591) 




GCTGCATCCG (SEQ 


ID 


NO: 


592) ; 


CTGCATCCGG (SEQ 


ID 


NO: 


593) 
•j j *j i 




TGCATCCGGA ( SEQ 


ID 


NO: 


594) ; 


GCATCCGGAG (SEQ 


ID 


NO: 


595) i 




CATCCGGAGT (SEQ 


ID 


NO: 


596) ; 


ATCCGGAGTA (SEQ 


ID 


NO: 


597) i 


30 


TCCGGAGTAC (SEQ 


ID 


NO: 


598) ; 


CCGGAGTACT (SEQ 


ID 


NO: 


599) • 




CGGAGTACTT (SEQ 


ID 


NO: 


600) ; 

www/ / 


GGAGTACTTC (SEO 


ID 


NO- 

x«w • 


601) • 

w U X. / j 




GAGTACTTCA (SEQ 


ID 


NO: 


602) ; 


AGTACTTCAA (SEQ 


ID 


NO: 


603) ; 




GTACTTCAAG (SEQ 


ID 


NO: 


604) ; 


TACTTCAAGA (SEQ 


ID 


NO: 


605) j 




ACTTCAAGAA ( SEQ 


ID 


NO: 


606) ; 


CTTCAAGAAC (SEQ 


ID 


NO: 


607) ; 


35 


TTCAAGAACT (SEQ 


ID 


NO: 


608) ; 


TCAAGAACTG (SEQ 


ID 


NO: 


609) ; 




CAAGAACTGC (SEQ 


ID 


NO: 


610) ; 


AAGAACTGCT (SEQ 


ID 


NO: 


611) ; 




AGAACTGCTG (SEQ 


ID 


NO: 


612) ; 


GAACTGCTGA (SEQ 


ID 


NO: 


613) . 
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Generally, the subject HTV-1 isolate is non-pathogenic as hereinbefore defined. 
Additionally, reference herein to "a deletion" includes reference to a contiguous or non- 
contiguous series of two or more deletions. 

5 The non-pathogenic isolate may carry a single decanucleotide deletion or may cany 
more than one decanucleotide deletion. Where it carries multiple deletions these may 
all correspond to a contiguous sequence or be from different parts of the nef gene. 
Furthermore, the terminal end portions of a deletion may lie within a decanucleotide as 
defined above. It is emphasised that the present invention extends to analogous 
10 sequences from other pathogenic strains of HIV-1 which might cany nef genes with a 
slightly altered sequence relative to HIV-1 strain NL4-3. 

In a most preferred embodiment of the present invention, there is provided a non- 
pathogenic strain of HIV-1 carrying a nucleotide sequence in its; genome as set forth 
15 in SEQ ID No. 614: 

GAAGAO&TTTG^AGAACATGACCT 

AAAATACATATACTCCTTACTTGAAAAATCGC 

AACTATTGGAATTGGATG^TGGGCAAGTTTGT 

CTGTGGTATATAAAAATATTCATAATGGTAGTAGGAGGCTTGATAGGTTTAAGAATAGT 
20 TTTTGCTGTACTTTCTATAGTGAATAGAGTTAGGCAGGGATACTCACCATTGTCGTTTC 
AGACCCTCCTCCCAACCCCGAGGGGACCCGACAGGCCCGAAGGAATCGAAGAAGAAGGT 
GGAGAGAGAGACAGAGACAGATCCACTCGATTAGTACACGGATTCTTAGCACtTTTCTG 
GGACGACCTGAGGAGCCTGTGCCTCTTCCTCTACCACCACTTGAGAGACTTACTCTTGA 
TTGTAACAAGGATTGTGGAACTTCTGGGACGCAGGGGATGGGAAGCCCTCAAATATTGG 
25 TGGAACCTCCTAAAGTATTGGAGCCAGGAACTGCZAGAAGAGTGCTGTTATCTTGCTCAA 
TGCCACCGCCATAGCAGTAGCTGAGGGGACAGATAGAGTTTTAGAAGTATTACAAAGAG 
CTTATAGAGCTATCCTCCACATACCTAGAAGAATAAGACAGGGCCTCGAAATGGCTTTG 
CTATAAAATGGGTGGCAAGTGAGCAAAAAGTAGTGTAGTCAGATAGCATGCATCATAAG 
GGGTGGGGGCCAAC^CTAACAATGCTGATCGTGC 

30 AGAAGCGGGTTTTCCAGTCAAACCTCAGGTAGCTGTAGATCTTAGCCACTTTTTAAAAG 
AAAAGGGGGGACTGGAAGGGCTAATTCACTCCCAAAGAAGACAAGATACACAGTGCTGC 
AAACTATTACCAGTGGAGTCAGCGAAGATAGAAGAGGCCAATGGAGGAGAAAACCACAG 
ATTGTTCTGTTGGGGACTTTCCATCCGTTGGGGACTTTCCAAGGCGGCGTGGCCTGGGT 
GACTAGTTCCGGTGGGGACTTTCCAAGAAGGCGCGGCCTGGGCGGGACTGGGGAGTGGC 

35 GAGCCCTCAGATGCTGCATATAAGCAGCTGCTTTCTGCTGTTACTGGGTCTCTCGGGTT 
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AGACCAGATCTGAGCCTGGGAGCTCTCTGGCTAACTAGGGAACCCACTGCTTAAGCCTC 
AATAAAGCTTGCCTTGAGTGCTTCAAGTAGTGTGTGCCCGTCTGTTGTGTGACTCTGGT 
ATCTAGA; 

5 and/or SEQ ID NO: 615: 

GAAACT^TTTGGGATAAC^TGACCTGGATGC^GTGGGAAAGAGAAATTGACAATTACAC 
AAAC^TAATATACACCTTAATTGAAGAATCGCAGAACCAACAAGAAAAAAATGAACTAG 
AATTATTGGAATTGGATAAATGGGCAAATTTGTGGAATTGGTTTAGTATATCAAACTGG 
CTATGGTATATAAAATTATTCATAATGGTAGTAGGAGGCTTGGTAGGTTTAAGAATAGT 

10 TTTTACTGTACTTTCTATAGTGAATAGAGTTAGGCAGGGATACTCACCATTGTCGTTTC 
AGACCCACCTCCCAACCCCGAAGGGACCCGACAGGCCAGAAGGAATCGAAGAAGAAGGT 
GGAGAGAGAGACAGAGGCAGCTCCACTCGATTAGTGCACGGATTCTTAGCACTTTTCTG 
GGACGACCTGAGGAGTCTGTGCCTCTTCAGCTACCACCACTTGAGAGACTTACTCrTGA 
TTGTAACGAGGATTGTGGAACTTCTGGGACGCAGGGGATGGGAAGCCCTCAAATACTGG 

1 5 TGGAATCTCCTGCAGTATT^ 

TGGCACGGCCATAGCAGTAGCTGAGGGGACAGATAGAG^^ 
CTTATAGAGCTATTCTCC^^ 

CTATAAAATGGGTGGCAAGTGGTCAGAAAGTAGTGTGGTTAGAAGGCATGTACCTTTAA 
GACAAGCK3AGCTATAGATCTTAGCCGCT 

20 AATTCACTCACAGAGAAGATCAGTTGAAC 

AAAACAACAGATTGTTCCGTTTGTTCCGTTGGGGACTTTCCAGGAGACGTGGCCTGAGT 
GATAAGCCGCTGGGGACTTTCCGAAGAGGCGTGACGGGACTTTCCAAGGCGACGTGGCC 
TGGGCGGGACTGGGGAGTGGCGAGCCCTCAGATGCTGCATATAAGCAGCTGCTTTCTC 
CTGTACTGGGTCTCTCTGGTTAGACCAGATCTGAGCCTGGGAGCTCTCTGGCTAACTAG 

25 GGAACCCACTGCTTAAGCCTCAATAAAGCTTGCCTTGAGTGCTTCAAGTAGTGTGTGCC 
CGTCTGTTGTGTGACTCTGGTATCTAGA . 

The present invention, however, extends to HTV-1 isolates which are non-pathogenic; 
carry genomes capable of hybridising under low stringency conditions to SEQ ID NO: 
30 614 or SEQ ID NO: 615; and which do not direct the synthesis of a full length nef gene 
product. 

In a further embodiment the present invention contemplates a viral isolate which: 

(i) is reactive to antibodies to a glycoprotein from HIV-1 such as at least one of 
35 gp41-45, gpl20 and/or gpl60; 

(ii) carries a genome or a part or fragment thereof capable of hybridising under 
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medium stringency conditions to a nucleotide sequence as set forth in SEQ ID 
NO: 1 or a complementary form thereof or an analogous sequence from another 
pathogenic strain of HIV- 1; 

(iii) carries a deletion of at least ten nucleotides in a region corresponding to the LTR 
5 region in HIV-1 NL4-3; and 

wherein said deletion encompasses one or more of the following decanucleotides from 

the LTR region of HTV-1 NL4-3 or corresponding sequences from another pathogenic 

strain of HIV-1: 



10 


GCTTTTTGCC 


(SEQ 


ID 


NO: 


652) ; 


CTTTTTGCCT 


(SEQ 


ID 


NO: 


653) 




TTTTTGCCTG 


(SEQ 


ID 


NO: 


654) ; 


TTTTGCCTGT 


(SEQ 


ID 


NO: 


655) 




TTTGCCTGTA 


(SEQ 


ID 


NO: 


656) ; 


TTGCCTGTAC 


(SEQ 


ID 


NO: 


657) 




TGCCTGTACT 


(SEQ 


ID 


NO: 


658) ; 


GCCTGTACTG 


(SEQ 


ID 


NO: 


659) 




CCTGTACTGG 


(SEQ 


ID 


NO: 


660) ; 


CTGTACTGGG 


(SEQ 


ID 


NO: 


661) 


15 


TGTACTGGGT 


(SEQ 


ID 


NO: 


662) ; 


GTACTGGGTC 


(SEQ 


ID 


NO: 


663) 




TACTGGGTCT 


(SEQ 


ID 


NO: 


664) ; 


ACTGGGTCTC 


(SEQ 


ID 


NO: 


665) 




CTGGGTCTCT 


(SEQ 


ID 


NO: 


666) ; 


TGGGTCTCTC 


(SEQ 


ID 


NO: 


667) 




GGGTCTCTCT 


(SEQ 


ID 


NO: 


668) ; 


GGTCTCTCTG 


(SEQ 


ID 


NO: 


669) 




GTCTCTCTGG 


(SEQ 


ID 


NO: 


670) ; 


TCTCTCTGGT 


(SEQ 


ID 


NO: 


671) 


20 


CTCTCTGGTT 


(SEQ 


ID 


NO: 


672) / 


TCTCTGGTTA 


(SEQ 


ID 


NO: 


673) 




CTCTGGTTAG 


(SEQ 


ID 


NO: 


674) ; 


TCTCTGGTTA 


(SEQ 


ID 


NO: 


675) 




CTGGTTAGAC 


(SEQ 


ID 


NO: 


676) ; 


TGGTTAGACC 


(SEQ 


ID 


NO: 


677) 




GGTTAGACCA 


(SEQ 


ID 


NO: 


678) ; 


GTTAGACCAG 


(SEQ 


ID 


NO: 


679) 




TTAGACCAGA 


(SEQ 


ID 


NO: 


680) ; 


TAGACCAGAT 


(SEQ 


ID 


NO: 


681) 


25 


AGACCAGATC 


(SEQ 


ID 


NO: 


682) ; 


GACCAGATCT 


(SEQ 


ID 


NO: 


683) 




ACCAGATCTG 


(SEQ 


ID 


NO: 


684) ; 


CCAGATCTGA 


(SEQ 


ID 


NO: 


685) 




CAGATCTGAG 


(SEQ 


ID 


NO: 


686) ; 


AGATCTGAGC 


(SEQ 


ID 


NO: 


687) 




GATCTGAGCC 


(SEQ 


ID 


NO: 


688) ; 


ATCTGAGCCT 


(SEQ 


ID 


NO: 


689) 




TCTGAGCCTG 


(SEQ 


ID 


NO: 


690) ; 


CTGAGCCTGG 


(SEQ 


ID 


NO: 


691) 


30 


TGAGCCTGGG 


(SEQ 


ID 


NO: 


692) ; 


GAGCCTGGGA 


(SEQ 


ID 


NO: 


693) 




AGCCTGGGAG 


(SEQ 


ID 


NO: 


694) ; 


GCCTGGGAGC 


(SEQ 


ID 


NO: 


695) 




CCTGGGAGCT 


(SEQ 


ID 


NO: 


696) ; 


CTGGGAGCTC 


(SEQ 


ID 


NO: 


697) 




TGGGAGCTCT 


(SEQ 


ID 


NO: 


698) ; 


GGGAGCTCTC 


(SEQ 


ID 


NO: 


699) 




GGAGCTCTCT 


(SEQ 


ID 


NO: 


700) ; 


GAGCTCTCTG 


(SEQ 


ID 


NO: 


701) 


35 


AGCTCTCTGG 


(SEQ 


ID 


NO: 


702) ; 


GCTCTCTGGC 


(SEQ 


ID 


NO: 


703) 




CTCTCTGGCT 


(SEQ 


ID 


NO: 


704) ; 


TCTCTGGCTA 


(SEQ 


ID 


NO: 


705) 
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CTCTGGCTAA (SEQ ID NO: 706) 
CTGGCTAACT (SEQ ID NO: 708) 
GGCTAACTAG (SEQ ID NO: 710) 
CTAACTAGGG (SEQ ID NO: 712) 
5 AACTAGGGAA (SEQ ID NO: 714) 
CTAGGGAACC (SEQ ID NO: 716) 
AGGGAACCCA (SEQ ID NO: 718) 
GGAACCCACT (SEQ ID NO: 720) 
AACCCACTGC (SEQ ID NO: 722) 
10 CCCACTGCTT (SEQ ID NO: 724) 
CACTGCTTAA (SEQ ID NO: 726) 
CTGCTTAAGC (SEQ ID NO: 728) 
GCTTAAGCCT (SEQ ID NO: 730) 
TTAAGCCTCA (SEQ ID NO: 732) 
15 AAGCCTCAAT (SEQ ID NO: 734) 
GCCTCAATAA (SEQ ID NO: 73(5) 
CTCAATAAAG (SEQ ID NO: 738) 
CAATAAAGCT (SEQ ID NO: 740) 
ATAAAGCTTG (SEQ ID NO: 742) 
20 AAAGCTTGCC (SEQ ID NO: 744) 
AGCTTGCCTT (SEQ ID NO: 746) 
CTTGCCTTGA (SEQ ID NO: 748) 
TGCCTTGAGT (SEQ ID NO: 750) 
CCTTGAGTGC (SEQ ID NO: 752) 
25 TTGAGTGCTT (SEQ ID NO: 754) 
GAGTGCTTCA (SEQ ID NO: 756) 
GTGCTTCAAG (SEQ ID NO: 758) 
GCTTCAAGTA (SEQ ID NO: 760), 
TTCAAGTAGT (SEQ ID NO: 762) , 
30 CAAGTAGTGT (SEQ ID NO: 764) ; 
AGTAGTGTGT (SEQ ID NO: 766) ; 
TAGTGTGTGC (SEQ ID NO: 768) ; 
GTGTGTGCCC (SEQ ID NO: 770) ; 
GTGTGCCCGT (SEQ ID NO: 772); 
35 GTGCCCGTCT (SEQ ID NO: 774); 
GCCCGTCTGT (SEQ ID NO: 776); 
CCGTCTGTTG (SEQ ID NO: 778); 
GTCTGTTGTG (SEQ ID NO: 780); 
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TCTGGCTAAC 


(SEQ 


ID 


NO: 


707) 


TGGCTAACTA 


(SEQ 


ID 


NO: 


709) 


GCTAACTAGG 


(SEQ 


ID 


NO: 


711) 


TAACTAGGGA 


(SEQ 


ID 


NO: 


713) 


ACTAGGGAAC 


(SEQ 


ID 


NO: 


715) 


TAGGGAACCC 


(SEQ 


ID 


NO: 


717) 


GGGAACCCAC 


(SEQ 


ID 


NO: 


719) 


GAACCCACTG 


(SEQ 


ID 


NO: 


721) 


ACCCACTGCT 


(SEQ 


ID 


NO: 


723) 


CCACTGCTTA 


(SEQ 


ID 


NO: 


725) 


ACTGCTTAAG 


(SEQ 


ID 


NO: 


727) 


TGCTTAAGCC 


(SEQ 


ID 


NO: 


729) 


CTTAAGCCTC 


(SEQ 


ID 


NO: 


731) 


TAAGCCTCAA 


(SEQ 


ID 


NO: 


733) 


AGCCTCAATA 


(SEQ 


ID 


NO: 


735) 


CCTCAATAAA 


(SEQ 


ID 


NO: 


737) 


TCAATAAAGC 


(SEQ 


ID 


NO: 


739) 


AATAAAGCTT 


(SEQ 


ID 


NO: 


741) 


TAAAGCTTGC 


(SEQ 


ID 


NO: 


743) 


AAGCTTGCCT 


(SEQ 


ID 


NO: 


745) 


GCTTGCCTTG 


(SEQ 


ID 


NO: 


747) 


TTGCCTTGAG 


(SEQ 


ID 


NO: 


749) 


GCCTTGAGTG 


(SEQ 


ID 


NO: 


751) 


CTTGAGTGCT 


(SEQ 


ID 


NO: 


753) 


TGAGTGCTTC 


(SEQ 


ID 


NO: 


755) 


AGTGCTTCAA 


(SEQ 


ID 


NO: 


757) 


TGCTTCAAGT 


(SEQ 


ID 


NO: 


759) 


CTTCAAGTAG 


(SEQ 


ID 


NO: 


761) 


TCAAGTAGTG 


(SEQ 


ID 


NO: 


763) 


AAGTAGTGTG 


(SEQ 


ID 


NO: 


765) , 


GTAGTGTGTG 


(SEQ 


ID 


NO: 


767) , 


AGTGTGTGCC 


(SEQ 


ID 


NO: 


769) ( 


TGTGTGCCCG 


(SEQ 


ID 


NO: 


771) , 


TGTGCCCGTC 


(SEQ 


ID 


NO: 


773) , 


TGCCCGTCTG 


(SEQ 


ID 


NO: 


775) , 


CCCGTCTGTT 


(SEQ 


ID 


NO: 


777) , 


CGTCTGTTGT 


(SEQ 


ID 


NO: 


779) , 


TCTGTTGTGT 


(SEQ 


ID 


NO: 


781) , 
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CTGTTGTGTG (SEQ ID NO: 782); TGTTGTGTGA (SEQ ID NO: 783) 

GTTGTGTGAC (SEQ ID NO: 784); TTGTGTGACT (SEQ ID NO: 785) 

TGTGTGACTC (SEQ ID NO: 786); GTGTGACTCT (SEQ ID NO: 787) 

TGTGTGACTC (SEQ ID NO: 788); GTGTGACTCT (SEQ ID NO: 789) 

TGTGACTCTG (SEQ ID NO: 790); GTGACTCTGG (SEQ ID NO: 791) 

TGACTCTGGT (SEQ ID NO: 792); GACTCTGGTA (SEQ ID NO: 793) 

ACTCTGGTAA (SEQ ID NO: 794); CTCTGGTAAC (SEQ ID NO: 795) 

TCTGGTAACT (SEQ ID NO: 796); CTGGTAACTA (SEQ ID NO: 797) 

TGGTAACTAG (SEQ ID NO: 798); GGTAACTAGA (SEQ ID NO: 799) 



The non-pathogenic isolate may carry a single decanucleotide deletion in the LTR region 
or may carry multiple deletions in the same region or in the LTR region and another 
region such as the we/gene. In particular, the mutation may be in the LTR/w/overlap 
region. Where it carries multiple deletions, these may correspond to a contiguous 
15 sequence or be from different parts of the LTR region and/or w/gene. Furthermore, 
the terminal end portions of a deletion may lie within a decanucleotide as defined above. 

Yet another aspect of the present invention provides an infectious molecular clone 
comprising genetic sequences derived from the non-pathogenic HIV-I isolates as 
20 hereinbefore described and includes genetic sequences encoding major structural proteins 
such as gag em and poL Infectious molecular clones are particularly useful as genetic 
compositions capable of "infecting" host cells without need of viral coat. The infectious 
molecular clones of the present invention may also be derived from pathogenic HIV-1 
strains rendered non-pathogenic as hereindescribed. 



According to this latter embodiment, there is contemplated a method of attenuating a 
pathogenic strain of HIV-1, said method comprising inducing a mutation in the we/gene 
and/or an LTR region to generate a non-pathogenic HIV-1 strain as hereinbefore 
described. Preferred mutations are deletions of at least ten nucleotides such as one or 
more of the decanucleotides as hereinbefore described. The mutations may also 
constitute substitutions and/or insertions of heterologous nucleic acid molecules in the 
nef and/or LTR regions such as the incorporation of a sense (i.e. co-suppression) 
antisense or ribozyme molecule (see below). 
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In still yet another embodiment of the present invention, there is provided an isolated, 
non-pathogenic strain of HIV-1 comprising a deletion in its genome of at least 10 
nucleotide within the region of nucleotide 8787 and 9709 and more particularly within 
the region 8800 and 9700 and even more preferably within the region 8800 and 9410, 
5 using the nucleotide numbering of HIV-1 NL4-3. 

In one embodiment, the deletion of at least 10 nucleotides is from within a region 
selected from the list consisting of (using the nucleotide numbers of HIV-1 NL4-3): 
nucleotide (i) 8830-8862; 
10 (ii) 9009-9035; 

(Hi) 9019-9029; and 

(iv) 9033-9049. 

In another embodiment, the deletion of at least 10 nucleotides is from within a region 
15 selected from the list consisting of (using the nucleotide numbers of HIV-1 NL4-3): 
nucleotide (v) 9281-9371; 

(vi) 9281-9362; 

(vii) 9105-9224; and 

(viii) 9271-9370. 

20 

In yet another embodiment, the deletion of at least 10 nucleotides is from within a 
region selected from the list consisting of (using the nucleotide numbers of HIV-1 NL4- 
3): 

nucleotide (ix) 8882-8928; 
25 (x) 8850-9006; 

(xi) 8792-9041; and 

(xii) 9112-9204. 



30 



In still yet another embodiment, the deletion of at least 10 nucleotides is from within a 
region selected from the list consisting of (using the nucleotide numbers of HTV-1 NL4- 

3): 
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nucleotide (xiii) 
(xiv) 
(xv) 



9281-9366. 



9105-9224; 
9389-9395; and 



5 

The above embodiments including any combinations thereof or functional equivalents 
thereof. Most preferred variants of HIV-1 are defined in Table 3. 

Reference herein to the non-pathogenic HIV-1 strains of the present invention includes 
1 0 reference to components, parts, fragments and derivatives thereof including both genetic 
and non-genetic material. Furthermore, the non-pathogenic HIV-1 strains may be in 
isolated form or resident in peripheral blood mononuclear cells (PBMCs) or other like 
cells where the genome of the HTV-1 strains is integrated as DNA from said HIV-1 
strains such as proviral DNA. In addition, the present invention extends to recombinant 
15 virus such as from (or resident in) prokaryotes or eukaiyotes as well as in the form of 
infectious molecular clones. 

Accordingly, the present invention provides for the non-pathoge;nic HIV-1 isolate, 
genomic material therefrom, complementary proviral DNA, molecular infectious clones, 
20 recombinant viral particles or genetic sequences therefrom or cells expressing same or 
blood cells carrying proviral DNA or to any mutants, derivatives, components, 
fragments, parts, homologies or analogues of the foregoing. 

The non-pathogenic HIV-1 strains of the present invention are particularly useful in the 
25 development of therapeutic compositions, therapeutic molecules and/or diagnostic 
reagents. With regards to the former, the non-pathogenic HIV-1 strain may be 
considered as a live attenuated vaccine where individuals carrying DNA derived from 
said non-pathogenic HIV-1 strain such as proviral DNA in target cells are protected from 
infection by a corresponding pathogenic strain. The term "vaccine" is used in its broadest 
30 sense as a therapeutic composition or molecule which prevents or reduces HIV-1 
infection or risk of infection or which ameliorates the symptoms of infection. It may 
involve the stimulation of an immune response or may involve blocking HIV-1 cells 
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receptors and/or the use of genetic compositions, for example, to introduce ribozymes 
or antisense molecules to HIV-1 directed genetic sequences or to prepare infectious 
molecular clones. For convenience, all such compositions are referred hereinafter to 
"therapeutic compositions' 1 . 

5 

Accordingly, the present invention contemplates a method for inhibiting or reducing the 
risk of infection by a pathogenic strain of HTV-1, said method comprising administering 
to a subject a non-pathogenic HIV-1 as hereinbefore defined in an amount effective to 
infect target cells and to generate target cells carrying proviral DNA from said non- 
10 pathogenic HTV-1 . More particularly, the present invention contemplates a method for 
inhibiting or reducing productive infection of an individual by a pathogenic strain of 
HIV-1, said method comprising administering to a subject a non-pathogenic isolate of 
HTV-1 in an amount effective to infect target cells and to generate target cells carrying 
proviral DNA from said non-pathogenic HIV-1. By "productive infection" as used in 
15 the specification and claims herein is meant the infection of a cell or cells by a 
pathogenic strain of HTV-1 which leads ultimately to the symptomology of AIDS or 
AIDS related diseases. A cell infected productively produces pathogenic virions. By 
definition, infection of an individual by a non-pathogenic strain of HTV-1 would not lead 
to productive infection. Non pathogenic HTV-1 strains generally replicate to a sufficient 
20 extent to protect cells against challenge by virulent or pathogenic strains. The methods 
of the present invention are applicable prophylactically (Le. to prevent de novo infection) 
or therapeutically (i.e. to reduce or slow disease progression). 

The present invention further provides a method for vaccinating an individual against 
25 the development of AIDS or AIDS related diseases, said method comprising 
administering to said individual a non-pathogenic isolate of HTV-1 in an amount 
effective to infect target cells and to generate target cells carrying proviral DNA from 
said non-pathogenic HIV-1. The term "vaccinating" should not be taken as limiting the 
invention to the prevention of HIV-1 infection by solely immunological means. The 
30 term "vaccinating" includes any means of preventing productive infection of an 
individual by pathogenic HTV-1. 
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As an alternative to the above methods, a therapeutic composition as hereinbefore 
defined is administered. The non-pathogenic isolate may be administered inter alia as 
an isolated viral preparation or via infected blood cells. Another aspect of the invention 
provides a therapeutic composition for inhibiting or reducing the risk of infection by a 
5 pathogenic strain of HIV-1 said therapeutic composition comprising a non-pathogenic 
strain of HIV-1 or genetic sequences derived therefrom as hereinbefore described and 
optionally one or more pharmaceutical^ acceptable carriers and/or diluents. The 
therapeutic composition of the present invention is generally suitable for intravenous, 
intraperitoneal, intramuscular, intramucosal (e.g. nasal spray, respiratory spray) or other 

10 forms of parenteral administration. The therapeutic composition might also be 
administered via an implant or rectally or orally. In addition to the mutations 
contemplated above, the non-pathogenic HIV-1 strain may also contain one or more 
other mutations to further reduce the risk of reversion to virulence and/or to insert a 
genetic sequence capable of providing directly or indirectly an identifiable signal, having 

15 ftiirther anti-HTV-1 properties and/or immunostimulatory or cell regulatory properties. 

For example, the non-pathogenic HIV-1 isolate in the therapeutic composition may 
comprise additional genetic material capable of directing expression of antisense 
nucleotide sequences to inhibit expression of one or more proteins encoded by a 
20 pathogenic strain of HIV-1. Alternatively, sense co-suppression may be employed. 
Preferred sense or antisense molecules would reduce expression of the nef gene or affect 
normal functioning of the LTR region. 

According to this embodiment, the non-pathogenic HIV-1 strain may be considered as 
25 a targeting agent to introduce genetic constructs capable of reducing expression of one 
or more HIV-1 proteins or polypeptides. In this embodiment there is provided a viral 
isolate which: 

(i) is genetically or immunologically related to a pathogenic strain of HIV-1; 

(ii) is substantially non-pathogenic in human subjects; 

30 (iii) comprises a first nucleotide sequence constituting its genome which is capable 
of hybridising under medium stringency conditions to SEQ ID NO: 1 or a 
complementary form thereof; and 
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(iv) comprises a second nucleotide sequence within said first nucleotide sequence and 
which second nucleotide sequence directs expression of a mRNA molecule 
capable of inhibiting, reducing or otherwise down-regulating translation of a 
protein or polypeptide encoded by a pathogenic strain of HIV-1 or inhibit, reduce 
5 or otherwise down regulate operation of a non-protein encoding a region of a 

pathogenic strain of HIV-1. 



Preferred proteins or polypeptides targeted for reduced expression are those encoded by 
one or more of the following: gag pol, env, tat, rev, vpu t vpr, w/and/or He/genes. The 

10 most preferred protein or polypeptide targeted for reduced expression is the product of 
the nef gene. Alternatively, the target nucleotide sequence does not encode a 
polypeptide or protein but is required for other functions such as integration or excision 
from the human genome or expression of genes on the viral genome. An example of 
such a nucleotide sequence is the LTR region. Accordingly, the present invention 

15 extends to disruption to the function of such regions. 

The above aspect relates to use of antisense technology. The present invention extends, 
however, to use of ribozymes and/or co-suppression to achieve the same results. In an 
alternative embodiment, or in addition to, the second (or optionally a third) nucleotide 
20 sequence encodes an antiviral agent (e.g. interferon) and/or an immune enhancing agent. 

The identification of deletions inter alia in the nef gene and/or LTR region in 
asymptomatic subjects provides a unique opportunity to study the in vivo effects of 
attenuated HTV-1 strains carrying one or more mutations in selected genetic regions. In 

25 particular, the present invention provides a means for designing therapeutic compositions 
directed to inhibiting expression of a nef gene and/or LTR region in a pathogenic HTV-1 
strain (such as contemplated above) as well as developing a therapeutic regimem aimed 
at inhibiting the activity of the nef gene product for the function of the LTR region. 
According to this latter embodiment, the present invention provides a therapeutic 

30 composition comprising a molecule capable of inhibiting the intracellular activity of the 
nef gene product and/or LTR region, said composition further comprising one or more 
pharmaceutically acceptable carriers and/or diluents. 
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The molecule contemplated by the above aspect of the subject invention may be a 
protein, polypeptide, peptide, chemical compound, sugar moiety or derivative of the nef 
gene product. The molecule will need to be capable of entering an infected cell. Where 
the molecule is a protein, polypeptide or peptide, it may be encoded by the second 
5 nucleotide sequence on the targeting vector as contemplated above. Alternatively, the 
molecule may be a nucleic acid molecule capable of targeting the nef gene or LTR 
region. 

The deletion mutants of the present invention may result in a modified nef gene product 
10 either having no readily discernable activity or having activity different to the naturally 
occurring nef protein. In any event, if a mutant nef gene product is produced, it will 
generally have a lower molecular weight than the naturally occurring nef protein and will 
have a different overall amino acid sequence. This provides, therefore, for a means for 
diagnosing individuals with benign HTV-1 infection by, for example, assaying for a 
15 modified nef protein or screening for a modified nef gene sequence. Alternatively, 
benign HTV-1 infection may be detected by assaying for a modified LTR region such 
as an altered nucleotide sequence. 

According to one embodiment, there is provided a method for determiiaing the 
20 pathogenicity of an HIV-1 strain after said HTV-1 strain infects cells of an individual, 
said method comprising contacting a biological sample from said individual with an 
effective amount of an antibody specific to a nef protein from a non-pathogenic strain 
of HTV-1 (as hereinbefore defined) for a time and under conditions sufficient to form 
an antibody-modified nef protein complex and then detecting said complex. The 
25 presence of said complex is indicative of a modified nef gene product and of the non- 
pathogenicity of the strain of HIV-1. The biological sample is a sample likely to contain 
the modified nef gene product such as tissue extract or cell extract of an infected cell. 
However, where the modified nef gene product is capable of permeation or transport out 
of the cell, suitable biological fluid would include serum, whole blood, lymph and 
30 mucosal secretion amongst other fluids. Many variations in the subject assay are 
possible and are contemplated herein. For example, an assay could be based on the 
inability for a nef specific antibody to bind to a modified nef protein. For the purposes 



WO 95/21912 PCT/AU95/00063 

-36- 

of the present invention the term "contacting" including "mixing". 



The presence of a modified nef molecule in biological fluid can be detected using a wide 
range of immunoassay techniques such as those described in US Patent Nos. 4,016,043, 
5 4,424,279 and 4,018,653. These include both single-site and two-site, or "sandwich", 
assays of the non-competitive types, as well as in the traditional competitive binding 
assays and include ELISA and RIA techniques. Sandwich assays are among the most 
useful and commonly used assays and are favoured for use in the present invention. A 
number of variations of the sandwich assay technique exist, and all are intended to be 

10 encompassed by the present invention. Briefly, and by way of example only, in a 
typical forward assay, a modified nef product-specific antibody is immobilised onto a 
solid substrate to form a first complex and the sample to be tested for modified nef 
product brought into contact with the bound molecule. After a suitable period of 
incubation, for a period of time sufficient to allow formation of an antibody-modified 

15 nef product secondary complex, a second modified nef protein antibody, labelled with 
a reporter molecule capable of producing a detectable signal, is then added and 
incubated, allowing time sufficient for the formation of a tertiary complex of antibody- 
modified nef product-antibody. Any unreacted material is washed away, and the 
presence of the tertiary complex is determined by observation of a signal produced by 

20 the reporter molecule. The results may either be qualitative, by simple observation of 
the visible signal or may be quantitated by comparison with a control sample containing 
known amounts of hapten. Variations of the forward assay include a simultaneous assay, 
in which both sample and labelled antibody are added simultaneously to the bound 
antibody, or a reverse assay in which the labelled antibody and sample to be tested are 

25 first combined, incubated and then added simultaneously to the bound antibody. These 
techniques are well known to those skilled in the art, and the possibility of minor 
variations will be readily apparent. The antibodies used above may be monoclonal or 
polyclonal. 

30 The solid substrate is typically glass or a polymer, the most commonly used polymers 
being cellulose, polyacrylamide, nylon, polystyrene, polyvinyl chloride or polypropylene. 
The solid supports may be in the form of tubes, beads, discs or microplates, or any other 
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surface suitable for conducting an immunoassay. The binding processes are well-known 
in the art and generally consist of cross-linking covalently binding or physically 
adsorbing the molecule to the insoluble carrier. 

5 By "reporter molecule", as used in the present specification, is meant a molecule which, 
by its chemical nature, produces an analytically identifiable signal which allows the 
detection of antigen-bound antibody. Detection may be either qualitative or quantitative. 
The most commonly used reporter molecule in this type of assay re either enzymes, 
fluorophores or radionuclide containing molecules (i.e. radioisotopes). In the case of an 

10 enzyme immunoassay, an enzyme is conjugated to the second antibody, generally by 
means of glutaraldehyde or periodate. As will be readily recognised, however, a wide 
variety of different conjugation techniques exist which are readily available to one 
skilled in the art. Commonly used enzymes include horseradish peroxidase, glucose 
oxidase, P-galaictosidase and alkaline phosphatase, amongst others. The substrates to be 

15 used with the specific enzymes are generally chosen for the production,upon hydrolysis 
by the corresponding enzyme, of a detectable colour change. It is also possible to 
employ fluorogenic substrates, which yield a fluorescent product. 

Alternatively, fluorescent compounds, such as fluorescein and rhodamine, may be 
20 chemically coupled to antibodies without altering their binding capacity. When activated 
by illumination with light of a particular wavelength, the fluorochrome-labelled antibody 
adsorbs the light energy, inducing a state of excitability in the molecule, followed by 
emission of the light at a characteristic colour visually detectable with a light 
microscope. As in the EIA, the fluorescent labelled antibody is allowed to bind to the 
25 first antibody-hapten complex. After washing off the unbound reagent, the remaining 
complex is then exposed to the light of the appropriate wavelength, the fluorescence 
observed indicates the presence of the hapten of interest. Immunofluorescence and EIA 
techniques are both very well established in the art and are particularly preferred for the 
present method. However, other reporter molecules, such as radioisotope, 
30 chemiluminescent or bioluminescent molecules, may also be employed. It will be 
readily apparent to the skilled technician how to vary the procedure to suit the required 
purpose. It will also be apparent that the foregoing can be used to label a modified nef 
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product and to use same directly in the detection of, for example, circulatory antibodies 
specific to said modified nef product. 

Alternatively, genetic assays may be conducted to screen for abberations in the nef gene 
5 and/or LTR region. Such a genetic assay may be by Southern or Northern blot analysis, 
PGR analysis or the like using oligonucleotides specific to a deleted region of a nef gene 
and/or LTR region. 

According to this embodiment there is provided a method for determining the 
10 pathogenicity of an HIV-1 strain after said HIV-1 strain infects cells of an individual, 
said method comprising determining directly or indirectly the presence of a deletion 
mutation in the genome of said HIV-1 wherein the presence of a such a mutation is 
indicative of the presence of a non-pathogenic strain of HTV-1. The deletion mutation 
may result in the genome being unable to synthesise a polypeptide or protein from a 
15 pathogenic strain of HIV-1 or may direct the synthesis of a truncated form of said 
polypeptide or protein. The mutation may also lead to altered expression of a 
polypeptide detectable by, for example, decreased synthesis of a particular protein, such 
as the nef gene product. Alternatively, the deletion mutation affects the LTR region or 
a regulatory region of the HTV-1 genome. In either case, affected viruses may also be 
20 detected by, for example, observing low viral copy numbers such as low viral loads. 

Preferably, said non-pathogenic HTV-1 carries a deletion in its genome of at least 10 
nucleotides within the region from nucleotide 8787 to nucleotide 9709 using the 
nucleotide numbering of HTV-1 strain NL4-3. 



Preferably, said non-pathogenic HTV-1 carries a deletion in its genome of at least 10 
nucleotides from within a region selected from the list consisting of: 



25 



nucleotide (i) 



8830-8862; 
9009-9035; 
9019-9029; and 



30 



00 
(Hi) 
(iv) 



9033-9049. 
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Preferably, said non-pathogenic HTV-1 carries a deletion in its genome of at least 10 
nucleotides from within a region selected from the list consisting of: 
nucleotide (v) 9281-9371; 

(vi) 9281-9362; 
5 (vii) 9105-9224; and 

(viii) 9271-9370. 

Preferably, said non-pathogenic HIV-1 carries a deletion in its genome of at least 10 
nucleotides from within a region selected from the list consisting of: 
10 nucleotide (ix) 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 

(xii) 9112-9204. 

15 Preferably, said non-pathogenic HIV-1 carries a deletion in its genome of at least 10 
nucleotides from within a region selected from the list consisting of: 
nucleotide (xiii) 9105-9224; 

(xiv) 9389-9395; and 

(xv) 9281-9366. 

20 

The above nucleotide numbers are based on the nucleotide numbering in the NL4-3 
genome. 

Particularly preferred oligonucleotides are based on the deleted regions of the ne/gene 
25 and/or LTR region such as but not limited to one or more oligonucleotides based on 
SEQ ID NO; 2 to SEQ ID NO: 613 and/or SEQ ID NO: 652 to SEQ ID NO: 799. 

The present invention further extends to kits for the diagnosis of infection by pathogenic 
strains of HIV-1 or for determining the pathogenicity of infecting virus. The kits would 
30 be in compartmental form each comprising one or more suitable reagents for conducting 
the assay. 



WO 95/21912 PCT/AU95/00063 

-40- 

The present invention is further described by the following non-limiting Figures and 
Examples. 

In the Figures: 

5 

Figure 1 is a representation showing the alignment of the nucleotide sequences from 
donor D36 peripheral blood mononuclear cell (PBMC) [D36P] and non-pathogenic HTV- 
1 from recipient C18 HIV StV [C18S], C18 HIV^c [C18M] and C98 fflV [C98H] and 
C54 PBMC [C54P] with the equivalent region of HIV-1 NL4-3. Sequences labelled 

10 PBMC are from patient PBMC, those labelled HTV are from virus isolated from patient 
PBMC and grown in culture. Numbering for HIV-1 NL4-3 is as per Myer et al (1992) 
where nucleotide 1 is the first nucleotide of the complete proviral DNA sequence. 
D36P, C18S, C18M, C98H and C54P are numbered from the start of the region 
sequenced. Identity with NL4-3 sequence is shown by (*). Deleted nucleotides are 

15 shown by (-). Spaces introduced to maximise alignment are shown by Q. Features in 
NL4-3 are marked by overlining the sequence, features in D36 and C18 are marked by 
underlining the sequence. 

Figure 2 shows the alignment of encoded amino acid sequences of (a) tat exon 3 and 
20 (b) rev exon 3 from HTV-1 NL4-3, D36 PBMC, CI 8 HIV StV and C98 fflV. In-phase 
termination codons (*) and NL4-3 encoded amino acid numbers are shown. 

Figure 3 is a representation showing the alignment of C-terminal envelope glycoprotein 
gp41 amino acid sequences encoded by D36 PMBC, C18 HIV StV , C18 HIV MBC and 
25 C98 HTV. Numbering is that of the amino acid sequence of the mature envelope 
glycoprotein of HIV NL4-3. Termination in shown by (#). 

Figure 4 is a representation showing alignment of amino acid sequences encoded by the 
we/genes of HIV-1 NL4-3, D36 PBMC, C18 HTV StV , C18 fflV^ and C98 HIV. In 
30 phase termination codons are shown by (#). Identical amino acids are shown by (*). 
Residues underlined are those immediately before a deletion. 
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Figure 5 shows a duplication of NEKB and Spl sequences in D36 PBMC, CI 8 HIV StV , 
CI 8 HIVj^bc and C98 HIV demonstrated by alignment of their low homology region 
sequences with the NFiS-Spl region of HIV-1 NL4-3. Nucleotide numbering 
according to Figure 1 . Identity with NL4-3 sequence shown by (*) and NFKB and Spl 
5 sites in NL4-3 overlined. Position of neJILTR region sequence deletion shown by (a). 

Figure 6 is a graphical representation showing replication of CI 8 and C98 viral isolates 
and D36 PBMCs from asymptomatic patients in PHA-stimulated PBMCs. 



10 




Figure 7 is a graphical representation showing replication of viral isolates from 
15 asymptomatic patients in non-PHA stimulated PBMCs. 



•■■••A"" 1 228200 




C98 
C18 
Mock 



20 

228200 is an Australian isolate of HIV-1 described by Kieman, R. et al (1990), Its 
characteristics include being T cell trophic, with fast kinetics, high producer of HIV-1 
and/or SI phenotype. 

25 Figure 8 is a graphical representation of cell surface receptor expression for syncytia- 
inducing (SI)/ non-syncytia-inducing (NSI)/asymptomatic patient isolates. 

■ TfR 

9 IL-2R 

■ CD4 



30 
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228200 is defined in the legend to Figure 7. 243925 is a viral isolate of HTV-1 which 
is monocyte/macrophage trophic and exhibits NSI phenotype (Dr Karen Coats-Fryer, 
PhD thesis entitled "Viral determinants of HIV-1 syncytium formation", the University 
5 of Melbourne, Parkville, Victoria, Australia. 

Figure 9 is a representation of the nucleotide sequence of C18 HTV-lj^ (SEQ ID NO: 
800). 

1 0 Figure 10(a) - (g) are graphical representations showing clinical immunology of cohort; 
(a) CD3; (b)(i) CD4 (ii) CD4%; (c)(i) CD8; (ii) CD8%; (d) lymphocyte count; (e) 
CD4/CD8 ratio; (f) p-2-microgIobulin; and (g) Kaplan-Meier estimates of time to 
disease progression (Cox & Oakes, 1989). 

1 5 Figure 11 is a schematic representation of the deletion mutants of the present invention. 
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A summary of the SEQ ID Nos. used in the subject specification is shown below: 



5 


SEQ ID NO: 


DESCRIPTION 




1 


Nucleotide sequence of HTV-1 strain NL4-3 genome 




2-613 


Decanucleotides of nef gene from HIV-1 strain NL4-3 


10 


614 


Partial nucleotide sequence of D36 HTV-1 isolate 




615 


Partial nucleotide sequence of C18 HTV-1]^ isolate 




616-625 


PCR primers shown in Table 1 




626-633 


Sequence primers shown in Table 2 




634 


Amino acid residues 15-27 of HTV-1 NL4-3 we/protein 


15 


635 


HTV-1 NL4-3 tat exons (Figure 2) 




636 


HTV-1 D36 tat exons (Figure 2) 




637 


HTV-1 CIS tat exons (Figure 2) 




638 


HTV-1 NL4-3 rev exons (Figure 2) 




639 


HIV-1 D36 rev exons (Figure 2) 


20 


640 


HIV-1 C18 rev exons (Figure 2) 




641 


HIV-1 NL4-3 C-terminal of gp41 (Figure 3) 




642 


HIV-1 D36 C-terminal of gp41 (Figure 3) 




643 


HIV-1 CI 8 C-terminal of gp41 (Figure 3) 




644 


HIV-1 NL4-3 nef gene (Figure 4) 


25 


645 


HIV-1 D36 nef gene (Figure 4) 




646 


HTV-1 C18 ne/gene (Figure 4) 




647 


HTV-1 NL4-3 NFKB/SP1 sequence (Figure 5) 




648 


HTV-1 D36 NFKB/SP1 sequence (Figure 5) 




649 


HTV-1 C18 NFKB/SP1 sequence (Figure 5) 


30 


650 


Nucleotide sequence of nef gene from HIV-1 strain NL4-3 




651 


Nucleotide sequence of env and nef regions of NL4-3 




652-799 


Decanucleotides of LTR region from HTV-1 strain NL4-3 




800 


Nucleotide sequence of C18 Hiy-lj^gc 
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EXAMPLE 1 

Source Material 

For the purposes of the following examples, a non-pathogenic HIV-1 strain was isolated 
5 from a recipient of HIV-1 infected blood. The recipient is designated H C18 M . Other 
recipients are defined as "C54" and "C98". The donor is identified herein as "D36". 
The place of isolation may be indicated after the abbreviation of "HIV". For example, 
St Vincents Hospital, Sydney (HTV StV ) or Macfarlane Burnet Centre of Medical 
Research, Melbourne (HIV^^). 

10 

Exemplary viral isolates referred to herein as "CI 8" and "C98" were deposited at the 
PHLS Centre for Applied Microbiology and Research, European Collection of Animal 
Cell Cultures (ECACC), Division of Biologies, Porton Down, Salisbury, Wiltshire SP4 
OJG. C18 was deposited on 17 October, 1994 under Provisional Accession Number 
15 V94101706 and C98 was deposited on 31 October, 1994 under Provisional Accession 
Number V941031 169. 

Figure 1 1 is a summary of the deletion mutants of the present invention. 

20 Viruses were isolated by the following procedures: 

1. Infected peripheral blood mononuclear cells (PBMCs) were co-cultured with 
HTV-1 seronegative donor PBMCs. A convenient source of seronegaitve donor 
PBMCs is a blood bank. The supernatants and cells are harvested every 7 days 
and fresh medium added with CD8 depleted PBMCs. CD8 depletion promotes 

25 the ability to isolate HIV-1. The culture and procedure is continued for up to 

approximately 5 weeks; 

2. The infected PBMCs are purified from whole blood and these cells are cultured 
alone for up to 5 weeks. PMBCs alone are used because the virus is more likely 

30 to be monocytotropic. Fresh medium is added weekly and supernatant fluid is 

harvested at this time; 
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3. Supernatant fluids are harvested every approximately 7 days, fresh medium and 
fresh HIV-1 seronegative CD8 depleted PBMCs are added at this time; 

4. HIV-1 seronegative PBMCs are pretreated with with M-CSF for approximately 
5 72 hours prior to the addition of infected PBMCs. M-CSF has been shown to 

enhance HTV-1 replication in monocytes (Gendelman et al, 1988); or 

5. The supernatant fluid is harvested from the cultures of step 4 every 
approximately 7 days, fresh medium added together with HIV-1 seronegative 

10 stimulated CD8 depleted PBMCs. The virus is isolated from the infected 

PBMCs. 

EXAMPLE 2 

DNA Preparation and PCR Amplification 
15 Non-pathogenic HIV-1 (e.g. strain CI 8) infected peripheral blood mononuclear cells 
(PBMQ were harvested 4 days after infection of phytohaemagglutinin (PHA) stimulated 
HIV-1 negative donor PBMC cultured by the method of Neate et al (1987) and washed 
in phosphate buffered saline (PBS). PBMC from Donor D36 and Recipients CI 8, C54 
and C98 were prepared by Ficoll isopaque centrifugation of buffy coat cells and washed 
20 with PBS. 

Approximately 10 7 cells were lysed in 1ml lysis solution (0.45% v/v NP40, 0.45% v/v 
Tween 20, lOmM Tris-HCl pH 8.3, 40mM KC1 2.5 mM MgClj) and digested with 
60ng/ml proteinase K (Boehringer Mannheim) at 55°C for 1 hour followed by 100°C 
25 for 10 minutes. Lysates were stored at -20°C. 

All polymerase chain reaction (PCR) primers (Table 1) and sequencing primers (Table 
2) were synthesised using an Applied Biosystems model 391 DNA synthesiser using 
phosphoramidite chemistry. 

30 

Strict physical separation was maintained for sample, PCR reagent mix and PCR reaction 
preparations as well as amplification and analysis. Final reaction mixes (50fil) contained 
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2 nl neat or diluted cell lysate, 0.2yM each primer, 200mM dNTPs and 1.25 units Taq 
polymerase (Boehringer Mannheim) in PGR buffer, (lOmM Tris-HCl pH8.3, 50mM KC1, 
100ng/ml gelatine) adjusted to the optimum MgCl 2 concentration for the primer pair 
(1.5-3.0mM). Aliquoted reagent mix was overlaid with 50jxl mineral oil prior to 
5 addition of DNA template lysate. After template denaturation at 94°C for 3 min 
amplification was achieved with 30 cycles of 94°C, 1 min; 55°C, 1 min; 72°C, 2 mins. 
A final elongation reaction was conducted at 72°C for 7 minutes. For double PCR 
amplification 2 nl of first round product was added to the second reagent mix and 
amplified as before. 

10 

PCR amplified DNA was checked for quality, quantity and fragment size by agarose gel 
electrophoresis in Tris-Acetate-EDTA buffer (Sambrook et al, 1989) stained in ethidium 
bromide and viewed by UV transillumination. 

15 EXAMPLE 3 

DNA Sequence Analysis 
The DNA sequence of PCR amplified HIV-1 regions was determined by the 
dideoxynucleotide method (Sanger et al, 1 997) using Sequenase T7 polymerase (United 
States Biochemicals). 

20 

PCR amplified DNA was purified by PCR Magic prep resin chromatography (Promega). 
Approximately 2 to 7 \ig purified DNA plus lOng specific primer (Table 2) were 
denatured by boiling for 3 mins and snap frozen to -20°C. The initial labelling reaction 
was for 3 minutes at 22°C (room temperature) with 35 SdNTP (500Ci/mmol; Dupont) 

25 followed by dideoxynucleotide termination reactions at 37°C for 5 minutes. NP40, to 
0.45% v/v, was included in denaturation and reaction mixes (Bachman et al, 1990). 
Sequencing reaction products were denatured in formamide and resolved on a 6% w/v 
polyacrylamide gel containing 8M urea, fixed in 10% v/v acetic acid, 10% v/v methanol 
and dried. Following autoradiography on XK1 film (Kodak) the gel sequences were 

30 read assembled, translated to protein and aligned using the PC/GENE suite of programs 
(IntelliGenetics, USA). 
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TABLE 1 
PCR PRIMERS 



PRIMER 


SEQUENCE 1, 3 


POSITION 2 


CM 


TGGAAGGGCTAATTTGGT(61 6) 


1-18 


Cl-2 


ATCTTCCCTAAAAAATTAGCCTGTC(6 1 7) 


2099-2075 


LTR-3' 


AGGCTCAGATCTGGTCTAAC(61 8) 


9559-9540 


SK68 


AGCAGCAGGAAGCACTATGG(61 9) 


7786-7805 


Cl-6 


TGCTAGAGATTTTCCACAC(620) 


9709-9691 


KS-2 


AGTGAATAGAGTTAGGCAGG(62 1) 


8326-8345 


RT5'-v3 


GTAAGACAGTATGATCAGATA(622) 


2418-2438 


RT3'-v2 - 


TTGTAGGGAATTCCAAATTCC(623) 


4660-4640 


RT5'-v2 


CAGGATCCTACACCTGTCAACATAAT(624) 


2487-2506 


RT3'-vl 


GGGAATTCCTTATTCCTGCTTG(625) 


4655-4634 



1. 

2. 
3. 



Sequence is presented from 5' to 3' of the primer. 

Position is according to the numbering of HTV NL4-3 in Myers et al (1992). 
SEQ ID NOs are given in parentheses. 
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TABLE 2 
SEQUENCING PRIMERS 



PRIMER 


SEQUENCE 1 ' 3 


POSITION 2 


KS3 


CCAGAAGTTCCACAATCC(626) 


8570-8553 


KS4 


TTCTTCTAGGTATGTGGAG(627) 


8753-8735 


KS5 


AGTGAATTAGCCCTTCCAG(628) 


9093-9075 


KS6 


TGCTAGAGATTTTCCACAC(629) 


9709-9691 


SP2 


TGCTCTGGAAAACTCAT(630) 


8006-8022 


SP3 


CTTTCTATAGTGAATAGAG(63 1) 


8318-8336 


SP4 


TATTGGAGTCAGGAACT(632) 


8618-8634 


SPR 


GGTCTAACCAGAGAGAC(633) 


9547-9531 



15 1. Sequence is presented from 5' to 3' of the primer. 

2. Position is according to the numbering of HTV NL4-3 in Myers et al (1992). 

3. SEQ ID NOs are given in parentheses. 

EXAMPLE 4 

20 Cells and Cell Culture 

Peripheral blood was obtained from HIV-1 sero-negative volunteers and mononuclear 
cells prepared by centrifugation on a Ficoll/Hypaque density gradient (Peper et al, 1968). 
PBMC were activated with phytohemagglutinin (PHA; 10ng/10 6 cells) for 48 h at 37°C 
washed and then cultured in RPMI 1640 medium containing 10% v/v heat inactivated 

25 foetal calf serum, 15mM HEPES, 0.1% v/v sodium bicarbonate, 25^g/ml polybrene 
(Sigma), 10% v/v interleukin 2 (Boehringer Mannheim) and 1:1000 anti-interferon 
(Miles) (IL-2 medium). Non-PHA stimulated cells were prepared in a similar manner 
except they were cultured in medium lacking PHA and IL-2. 
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EXAMPLE 5 

Antipeptide-antisera 

Antibodies specific for HIV-1 Nef were raised against a peptide corresponding to the 
predicted amino acid residues 15-27 (AVRERMRRAEPAA SEQ ID NO: 634) of Nef 
5 encoded by the HIV-1 clone pNL4.3 (Kemp et al, 1988). The peptide was conjugated 
to keyhole limpet hemocyanin (KLH; Calbiochem, Behring Diagnostics, CA) via 
glutaraldehyde and this complex used to immunise sheep (O.Smg peptide 
conjugate/sheep). Antibodies to the peptide were purified by affinity chromatography. 
Reactivity of the antibodies with recombinant HIV-1 Nef 25 and 27 was demonstrated 
10 by immunoblotting. 

EXAMPLE 6 

Reactivity of anti-Nef w ^ C18-infected Cells in Immunoblotting 
Seven days post-infection HIV-1 C18-infected PBMCs and mock - infected cells were 

15 washed in PBS then lysed (0.5% w/v NP-40, 0.5% w/v sodium deoxycholate, 50mM 
NaCl, 25mM Tris-HCl, lOmM EDTA, 0.01% w/v sodium azide and lOmM 
phenylmethylsulphonylfluoride). After nuclei were spun out lysates were 
electrophoresed in a 13% w/v SDS-polyacrylamide gel (SDS-PAGE) and subsequently 
transferred to Hybond-C nitrocellulose (Amersham, Buckinghamshire, England) for 1 

20 h at 100 V using a Bio-Rad protein transfer cell (Bio-Rad, Richmond, Ca). Membranes 
were pre-incubated with 1% w/v BSA/PBS for 2 h at room temperature and then reacted 
with affinity purified sheep anti-Nef^ 5 , 2 7), diluted 1:100, overnight at room temperature. 
After three washes in 1% w/v BSA/PBS, the blots were incubated with donkey anti- 
sheep Ig conjugated to biotin (Amersham, diluted 1 :500) for 1 h at room temperature. 

25 After extensive washing as described above die membranes were incubated with 
streptavidin-conjugated horse radish peroxidase (Amersham; diluted 1:500 for 1 h at 
room temperature. All dilutions were made with 1% w/v BSA in PBS. After further 
washing the membrane was developed with phenylenediamine substrate (Dako, 
Dapopatts, Denmark). The antibody preparation used in the immunoblotting experiments 

30 was free of detectable antibodies to the immunogenic carrier protein and coupling 
reagent. 
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EXAMPLE 7 
Analysis by Polymerase Chain Reaction Amplification 
A 5' fragment defined by primers CM and Cl-2 containing the 5' LTR and part of the 
gag gene was amplified. DNA from HTV-1 C18 infected PBMC gave an amplified 
5 fragment (amplimer) of about 1.9 kb compared with 2.1kb for pHXB2 control template, 
implying a deletion of about 200bp from fflV C18. Further amplification of this 
fragment with primers defining the U3 region of the LTR (Cl-1 and LTR-3') gave 
amplimers of about 300 bp for HTV-1 CI 8 infected PBMC DNA compared with 340bp 
for C18 and D36 PBMC DNA and 484 bp for pHXB2 control. This implies the loss of 
10 approximately 140 to 180bp from the U3 region of these proviral DNAs. 

To analyse the nef-gene-3' LTR region, the nested primer pairs SK68-C1-6 and KS-2- 
LTR-3' were used in a double PCR. Amplimers of approximately 830bp were obtained 
for HTV-1 CI 8 infected PBMC DNA as well as for PBMC DNA from Donor D36 and 
15 Recipients Cl ( 8, C54 and C98 compared with approximately 1230bp for pHXB2 DNA. 
These results suggest that about 400bp of DNA have been deleted from the Donor and 
Recipient proviral DNAs. 

In comparison, amplification of the polymerase gene region by double PCR with the 
20 nested primer pairs RT5'-v3-RT3'-v2 and RT5'-v2-RT3'-vl gave a fragment 
(approximately 2.1 kb) the same size as the molecular clone pHXB2 fragment for HTV-1 
CI 8 infected PBMC DNA, suggesting that deletions from this region were unlikely. 

EXAMPLE 8 

25 Nucleotide Sequence of the nef-3 'LTR Region 

PCR amplification experiments indicated an approximately 200bp nucleotide deletion 
from both the nef gene and LTR regions of Donor D36 PBMC and Recipient CI 8 HTV-1 
proviral DNA. To further analyse these regions, the DNA sequence was determined for 
the PCR amplified nef-3 -LTR region of D36 PBMC, CI 8 isolates HIV^ and HIV StV 

30 as well as isolate C98 HIV infected PBMC proviral DNA. The 3' region was amplified 
with outer primers (SK68-C16) and inner primers (SK68-LTR 3' or KS2-C16) and 
sequenced directly using a number of internal sequencing primers based on the HTV-1 
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NL4-3 nucleotide sequence (Table 2). 

Alignment of the nucleotide sequences of the amplified 3' region of donor D36 PBMC 
and recipient C18 isolates HIVj^ and KV stv and C98 HIV (Fig 1) showed a number 
5 of nucleotide sequences changes, including deletions, relative to the nucleotide sequence 
of wild-type infectious HIV-1 (HIV-1 NL4-3). In the region of alignment, D36 PBMC 
lacked 291 nucleotides, CI 8 HIV StV differed in size by 388 nucleotides (comprising 
deletions of 397 nucleotides and an insertion of 9 nucleotides), CI 8 HIV MBC differed 
by 456 nucleotides and C98 fflV lacked 158 nucleotides compared with HIV-1 NL4-3. 
10 The overall identity with HTV-1 NL4-3 nucleotide sequence of D36 PBMC, C18 
HIV StV , HTVj^q and C98 HIV nucleotide sequences, including deletions, was 73% 
(1157/1596), 67% (1459/1592), 62% (982/1592) and 79% (1105/1399), respectively. 

The D36 PBMC sequence differed from HIV-1 NL4-3 in a number of features. A 
15 change in the wild type tat termination codon from TAG to TCG (Ser) extended the 
third tat exon (which starts at splice acceptor 10) by a further 15 amino acids to 
terminate at a conserved TAG (Fig 1). The resulting C-terminal peptide is rich in 
charged amino acids (8/15) (Fig 2a). The wild type rev termination codon has also 
changed (TAG to GAG, Glu) and the third rev exon is extended for 14 codons to 
20 terminate at a conserved TAG (Fig 2b). The encoded extra amino acids are mainly 
polar (1 1/14) and charged in nature (Fig 2b). The sequence also encodes the C-terminal 
237 amino acids of Env gp41 (Fig 3) terminating at the normal termination codon. The 
D36 PBMC Env amino acid sequence has 85% identity with the HTV NL4-3 sequence, 
increasing to 89% if similarities are included. 

25 

There are significant differences from HIV-1 NL4-3 downstream of the env (gp41) gene. 
A change in the fifth nef codon, from TGG (Trp) to TGA (Fig 1), introduces an early 
termination in the D36 PBMC nef gene. The encoded Nef protein is identical to the N- 
terminal 4 amino acids of NL4-3 Nef (Fig 4). Following the early termination there are 
30 deletions of 33, 47, 93 and 91 nucleotides and a region of low sequence homology, 
compared with HIV NL4-3, prior to the wild type we/termination codon site (HIV NL4- 
3 nts 9405-9407). As well as removing a significant part of the nef gene, these deletions 
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also bring into phase a further 6 termination codons. While the polypurine tract (plus 
strand primer binding site) and the first 38 nucleotides of the LTR U3 region are 
perfectly conserved, downstream U3 region sequences are disrupted by the 93 and 91 
nucleotide deletions and the low homology region. The resulting U3 region lacks 
5 recognition sequences for the transcription factors c-myb, USF and TCFlct as well as 
one of the suggested NF-AT sites (Gaynor et al, 1992). Downstream from the 91 
nucleotide deletion, a 59 nucleotide region of low homology contains two extra NFKB 
enhancer sites 19 nucleotides upstream of the usual site of a pair of NFKB sites, the 
upstream one of which is altered in its 5'-half in D36. Sequences further downstream 
10 are highly conserved with respect to HIV-1 NL4-3, including the position and number 
of Spl basal promoter sites, TATA box, TAR and polyadenylatipn signal sequences. 

Similar to D36 PBMC, the C18 HIV Stv and HTV^c sequences show the tat third exon 
to be extended by 15 codons. All but two codons (altered by point mutations) are 
15 identical to those of D36 PBMC (Fig 2a). The rev third exon of both CI 8 isolates is 
also extended (Fig 2b) but by only three codons, identical to the first three codons of 
the D36 PBMC rev extension. The same 237 amino acid coding region of Env gp41 is 
found in both the CI 8 HTV DNA sequences (Fig 3) and shows 85% identify, increasing 
to 88% if similarities are included, with the same region of the NL4-3 Env gp41. 

20 

It is in the nef gene and LTR regions that the major differences from wild-type HIV-1 
arise, just as in D36 PBMC. The nef gene of CI 8 HIV StV encodes 24 amino acids with 
9 of the 10 N-terminal being identical to the NL4-3 Nef protein (Fig 4). Thereafter, 
deletions of 177 and 1 1 nucleotides cause a frameshift and termination at the 25th codon 
25 (Fig 1). Downstream deletions of 120, 82 and 7 nucleotides cause further loss of wild 
type we/gene sequence and bring into phase a further three termination codons. 

The nef gene of CI 8 HTVj^ encodes only 7 amino acids with only the initiator 
methionine identical to the NL4-3 Nef protein. This loss of identity and early 
30 termination is brought about by a 250 nucleotide deletion after the fifth nucleotide of 
the nef gene. Downstream deletions of 120 and 86 nucleotides cause further loss of 
wild-type nef gene sequences. In both CI 8 isolates there is perfect conservation of the 
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polypurine tract and 29/31 nucleotides at the 5' end of the U3 region immediately before 
the 120 nucleotide deletion (Fig 1). This deletion together with the downstream 82 and 
7 nucleotide deletions in HIV StV and 86 nucleotide deletion in HIV MBC and the low 
homology region cause the loss of the 5' half of the NRT-1 site (Yamamoto et al 1992) 
5 and the downstream NFAT site. A third NFKB site is present 31 (HIV stV ) and 33 
(HlVj^jg^ nucleotides upstream of the expected pair of NFKB sites which are 
themselves separated by 13 nucleotides instead of the 4 nucleotides in HIV-1 NL4-3. 
The 5'-most Spl site sequence is slightly altered but sequences downstream including 
the other 2 Spl sites, the TATA box, TAR and polyadenylation signal sequences are 
10 identical to HTV-1 NL4-3 sequence. 

The three sequences, D36 PBMC, C18 HTV StV and C18 HTVj^ show a number of 
similarities consistent with the transmission of virus from person D36 to person CI 8 as 
well as a number of differences indicating post-transmission divergence of sequence. 

15 All three have tat open reading frames (ORFs) extended by 1 5 codons. All three have 
extended rev ORFs. The new rev termination codon in both CI 8 HIV-1 isolates, three 
codons downstream of the NL4-3 rev termination codon, has a point mutation in D36 
PBMC to make a Glu codon so that it continues for a further 11 codons (Fig 2b) to 
terminate at a conserved TGA. The partial Env gp41 amino acid sequences are more 

20 closely related to each other (86% identity or 90% including similarities) than to HTV 
NL4-3 (85% and 89%, respectively). 

The nucleotide sequence of the nef and LTR region of the HIV-1 isolate from recipient 
C98 (C98 HIV) is 90.3% identical (1264/1399) to the HIV NL4-3 sequence, ignoring 

25 deletions. Similar to the D36 PBMC and CI 8 HIV StV and fflVj^ isolates the C98 
HIV sequence shows the third exon of tat to be extended by 15 codons with all but one 
being identical to the D36 PBMC tat extension. Also, the rev gene is extended by 3 
codons, 2 of which are identical to the first 2 codons of the D36 PBMC rev extension. 
The sequence also encodes the C-terminal 223 amino acids of Env gp41 terminating at 

30 the normal termination codon. The C98 HTV Env amino acid sequence has 89% identity 
with HTV NL4-3 Env sequence, increasing to 92% of similarities are included. 
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As with the D36 PBMC and the C98 HIV isolate sequences it is the we/gene and LTR 
regions that major differences from the HIV NL4-3 sequence arise. The nef gene open 
reading frame of C98 HIV is much longer than in D36 PBMC, CI 8 HIV StV and 
HIV^jgQ, encoding 85 amino acids compared with 206 amino acids for NL4-3. Sixty 
5 eight of those 85 amino acids are identical to the N-terminal sequence of NL4-3 Nef. 
The single, small deletion (16 nucleotides) in the C98 HIV w^alone regions (Table 3) 
occurs after nef codon 82 causing a frameshift and termination after a further 3 codons 
at the start of the highly conserved polypurine tract sequence immediately before the 3'- 
LTR. The nefTLlK region has two deletions totalling 142 nucleotides. The 5'-most 

10 deletion of 42 nucleotides includes the splice acceptor 12 sequence. The NRT-1, dyad 
symmetry and myb response element sequences are all intact However, the downstream 
100 nucleotide deletion includes sequences from the 3' end of the 5'-NF-AT and all of 
the 3' NF-AT sequences as well as the USF transcription factor recognition sequence. 
The downstream low homology region of 77 nucleotides lacks the TCF-la sequence but 

15 has two additional NFKB sites 13 nucleotides apart and 26 nucleotides upstream of the 
3'-half-remmant of the normal 5'-NFKB site. Sequence downstream, including the 3'- 
NFKB site, the 3 Spl sites, TATA box TAR and polyadenylation signal sequences are 
all highly conserved. 

20 The main feature of the sequences is the series of deletions, with respect to HIV NL4-3, 
in the nef gene-3'-LTR region. These can be grouped into two regions namely the nef 
alone region, that part of the nef gene upstream of the LTR, and the ne^LTR region, 
where the nef gene and LTR U3 regions overlap. The deletions in these regions of each 
of the sequences start and end at the same or similar positions (Table 3). The deletions 

25 are larger in C18 HIV StV and C18 HIV^c sequences where totals" of 397 and 456 
nucleotides have been deleted (relative to NL4-3) compared to 291 nucleotides, from 
D36 and 158 nucleotides from C98 HIV. In the we/^alone region the two deletions in 
CI 8 HTV StV and the single deletion in CI 8 HIV^c occupy the same region as the three 
deletions in D36 PBMC. Similarly, the nefLTR region in the three deletions in the CI 8 

30 HTV StV , the two deletions in the CI 8 HIV,^ and the D36 PBMC sequences occupy 
the same region. These findings indicate that mutant virus was transmitted from D36 
to CI 8 after which further deletions and rearrangements occurred. Similarly, the 
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sequence of C98 HIV in the neflLTK region indicates two deletions occupying the same 
region as the neflLTK deletions in D36 and the CI 8 sequences. However, the size (only 
16 nucleotides) and the position of the deletion in the ne/alone region of C98 HIV are 
distinct from those of the D36 PBMC and CI 8 sequences. 

5 

The timing of transmission of virus by transfusion was that recipient CI 8 was transfused 
approximately 19 months after C98. Consistent with the relative timing of transmission 
and the sequence similarities and differences is the suggestion that at the time of 
transmission to C98, the D36 sequence had deletions in the neflLTK region but not in 

10 ne/alone region. After transmission to C98, the C98 virus developed further deletions 
and rearrangements, including the deletion in the wefalone region. The D36 virus 
evolved so that at the time of transmission to CI 8, further deletions and rearrangements 
had occurred including deletion of sequence from the ne^alone region distinct from the 
C98 HTV ne/-alone region deletion. After transmission to CI 8, further deletions and 

15 rearrangements occurred in the CI 8 virus giving rise to at least two sequences (HIV StV 
and HTV^go). 

The zie^alone deletion region may be a mutation or recombination "hotspot" as it 
includes sequences that were found to be variably duplicated in 28 out of 54 Nef protein 

20 sequences derived from 8 of 12 patients analysed in a study (Shugars et al 1993). The 
sequence between the zie^alone and the neflLTK region deletions is highly conserved 
and is important in provirus integration into the infected cell genome and interacts with 
a number of cellular proteins. It is interesting that the sequence equivalent to NL4-3 
nucleotides 9209 to 9225 is retained in D36 and C98 HTV but lost in the CI 8 HIV 

25 sequences. This includes part of a sequence of dyad symmetry (9210 to 9231) and is 
a significant part of the binding site for NRT-1 (Yamamoto et al 1991) which has been 
shown to have a negative regulatory effect on HIV-1 expression. The presence of this 
sequence in D36 and C98 HIV and its absence from the CI 8 isolates may correlate with 
the inability to isolate virus from D36 PBMC and the poor replication of C98 HIV but 

30 the ability to isolate HIV-1 from CI 8 PBMC. The deletion of sequence equivalent to 
nucleotides 9281 to 9395 of NL4-3 causes the loss of some transcription factor binding 
sites including NFAT and USF from the D36, CI 8 HTV and C98 HTV sequences. 
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A further similarity between the D36, CI 8 HIV StV , C18 HIV^c and C98 HIV 
sequences is a region of low homology to HIV-1 NL4-3 extending downstream of the 
neflLTR deleted region to the NFKB enhancer/Spl promoter site region. This low 
homology region in fact consists of incomplete duplications of part of the NFKB/Spl 
5 region (Fig 5) resulting in D36 and C98 HIV having 2 extra NFKB sites upstream of 
an altered 5' NFKB site while the CI 8 sequences have one extra NFKB site and altered 
spacing between the 5' and 3' wild type NFKB sites due to an insertion of 9 nucleotides. 

For the CI 8 and C98 HIV-1 isolates virus replication was assessed in PHA-stimulated 
10 and non-stimulated PBMCs (Figs 6 and 7). In PHA-stimulated PBMCs we also studied 
cell surface CD4 and IL-2R expression (Fig 8). In comparison with HIV-1 wild type 
SI and NSI isolates clearly both CI 8 HTV^c and C98 viruses are replication 
competent, though C98 HTV replicates more poorly than C18 HTV MBC and are of the 
NSI phenotype when syncytium formation and CD4 and IL-2R surface expression are 
IS taken into account. Additionally, and more surprisingly, these two viruses replicated 
almost as efficiently in non-PHA stimulated PBMCs when compared to a typical local 
wild type SI isolate (HIV-1 228200, Fig 7). 

When protein expression was assessed for CI 8 HTV^c and C98 HIV^q although 
20 structural proteins were identified, no typical Nef protein was seen in infected cells. 
However, analysis of cell lysates prepared from PBMC infected with CI 8 HTV^q or 
PBMC infected with C98 HTV,^ (which were subsequently stimulated by UV 
irradiation, see Valerie et al, 1988) by Western immunoblotting using two antibodies 
specific for the N-terminal region of Nef showed the presence of smaller proteins of 19 
25 kDa and 21 kDa, respectively. These proteins were not observed in mock-injected 
control PBMC lysates and were not observed when the infected-cell lysates were probed 
were probed with antibodies reactive with the C-terminus of Nef. 

Thus, although the CI 8 and C98 HTV isolates are replication competent in vitro they 
30 clearly replicate differently using different conditions for cell activation and from the 
known functions of HIV-1 Nef protein and the LTR show that the major deletion in the 
nef gene and/or the LTR is at least in part responsible for the outcome of infection, 
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implicating the importance of Nef and/or the LTR in the clinical outcome of infection 
in vivo. 

EXAMPLE 9 

5 Determination of Degree of Relatedness Between Viruses 

To determine the degree of relatedness between viruses such as between mutants or 
between mutants and a wild-type virus and to ascertain putative infected patients, the 
method of Delwart et al was employed. 



10 EXAMPLE 10 

Immune responsiveness of subjects infected by non-pathogenic HIV-1 isolate 
In this example, the donor and recipients of the cohort were tissue typed and assessed 
for basic cellular immune responses. Proliferative responses and IL-2 production to the 
mitogens ConA and PHA, to allogeneic mononuclear cells (irradiated pooled 
IS mononuclear cells from 20 random donors) and to recall antigens (e.g. influenza and 
tetanus toxoid) were within normal ranges. While at the immunogenetic level, HLA 
typing failed to identify a consistently common allele or haplotype within the group. 

The conservation of CD4+ counts observed in the cohort, the relative integrity of their 
20 immune system and the varied HLA types of the donor and recipients further supports 
the fact that the symptomless condition of the cohort members is due to a non- 
pathogenic strain of HTV-1 or a strain of low virulence. 

Accordingly, this provides a screening procedure for subjects putatively infected by a 
25 non-pathogenic HTV-1 isolate where such subjects are seropositive for HIV-1 (e.g. have 
antibodies to an HIV-1 glycoprotein) yet have normal proliferative responses and 
cytokine production to mitogens, allogeneic mononuclear cells and to recall antigens. 
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EXAMPLE 11 

Clinical Immunology of Cohort 
To establish that the donor and the recipients belonging to the cohort exhibit normal 
5 immunological profiles, members of the cohort were assayed for CD3, CD4, CD8, 
lymphocyte count, CD4/CD8 ratio and p-2-microglobulin over time since 
seroconversion. 

Parameters considered normal in non-infected individuals range as follows: 



Parameter 






CD3 


55-82% 


620-2200 (xl0 6 /L) 


CD4 


29-58% 


420-1410 (xl0 6 /L) 


CD8 


12-43% 


200-980 (xl0 6 /L) 


Lymphocyte count 


1000-3500 (xl0 6 /L) 




CD4/CD8 


0.7-3.7 




(^-microglobulin 


0.00-2.20 mg/L 





20 The results are shown in Figures 10(a)-(g) and clearly show that the immunological 
profiles of cohort members are substantially normal further highlighting the non- 
pathogenicity of the HTV-1 isolates of the present invention. Figure 10(g) shows a graph 
of the Kaplan-Meier (Ox and Oates, 1989) estimates of time to disease progression 
(AIDS or CD4 > 250). The results demonstrate that the difference is large in spite of 

25 the conservative bias, with a median time to progression of 6.2 years in the main 
database. An exact logrank test (Cytel Software Corporation, 1989, StatXact: Statistical 
Software for Exact Nonparametric Inference. Cambridge, Massachusetts.) was 
performed, demonstrating that the difference between the groups was highly statistically 
significant (logrank statistic 11.8, pO.OOOl). 

30 
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EXAMPLE 12 
Sequencing of isolate H1V-1 C18 MBC 
The genome of variant HIV-1 designated CI 8 HIV-1 MBC was amplified by the 
5 polymerase chain reaction (PCR) as 7 overlapping fragments using the sets of inner and 
outer oligonucleotide primers, designed using the programme PCRPLAN 
(IntelliGenetics), listed in Table 5 and either UlTma (Applied Biosystems) or a mixture 
of KlenTaq and Pfu (KlenTaq LA, Ab Peptides Inc) polymerases (for faithful 
amplification of long fragments). The resulting fragments were cloned into the Smal 

10 site of the plasmid vector pGEM7Zf+. Insert-containing clones representing each region 
of the full length variant HIV-1 were sequenced by a nested deletion strategy (Gou & 
Wu, 1982) and cycle sequencing with Taq polymerase and dye labelled primers 
complimentary to the T7 or SP6 sites within the cloning vector. Nucleotide sequences 
were entered and collated by ASSEMGEL and SEQIN (IntelliGenetics) and SEQED 

IS (Applied Biosystems) and translated to the encoded amino acid sequences using 
TRANSL (IntelliGenetics) programmes. Sequence alignments used NALIGN, 
CLUSTAL GntelliGenetics) and SEQED programmes. 

The full length sequence (Fig 9; SEQ ID NO:800) of isolate HIV-1 018,^ is 9207 
20 nucleotides long which is 506 nucleotides shorter than the HTV NL4-3 sequence. This 
size difference is comprised of 126 nucleotides of insertions and 632 nucleotides of 
deletions, see Table 6. The most extensive differences between the HTV-1 ClS^c 
sequence and HIV-1 NL4-3 are in the U3 region of the LTR and in the ne/gene, as 
hereinafter described. 

25 

The 5' LTR has deletions of 120 and 87 nucleotides and a region of low sequence 
homology, which is the result of an imperfect duplication of the downstream NFkB and 
Spl response sequences. These result in the loss of sequence from a number sites 
important in the regulation of transcription of HTV-1 genes, including the negative 
30 response element (NRE) and the response elements for a number of transcription factors 
including NF-AT, NRT-1, USF and TCF-la. Furthermore, the low homology region 
contains an extra NFkB and Spl sites as well as an insertion of 9 nucleotides between 
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the usual NFkB sites. Downstream of the NFkB sites the sequence of the LTR has a 
high level of homology (96.2%) with the same region of HIV NL4-3. 

The gag gene contains 3 insertions, which represent direct repetitions of adjacent 
5 sequences. The first is a perfect repeat of 15 nucleotides after the equivalent of 
nucleotide 1 134 of HIV-1 NL4-3 and adds 5 amino acids to the C-terminus region of 
pH 5 * 8 . The remaining 2 insertions are imperfect and perfect repeats of 30 and 6 
nucleotides, respectively, after the equivalent of HIV NL4-3 nucleotides 2163 and 2232, 
respectively. These encode an extra 12 amino acids in the C-terminus region of pl5 gag 
10 just downstream of the gag to pol frameshift sequences. The variation in sequence 
length of the gag gene at these two positions is unusual. The homology of the encoded 
amino acid sequence of HIV-1 018^^ and HTV NL4-3 for the gag pi 7, p24, and pl5 
proteins is 87.1%, 93.5% and 94.3%, respectively. 

15 In the pol ORF, the encoded proteins have high homology with the NL4-3 sequences 
being 95.5% overall comprising plO protease 92.9%, p66 reverse transcriptase 95.4% 
and p34 integrase 95.8%. The amino acid sequence of the p61 RT lacks the mutations 
associated with resistance to the nucleoside (AZT, ddl, ddQand non-nucleoside 
(Nevirapine) analogue drugs used in the treatment of HTV- 1 -infected persons. 

20 

The vz/gene encodes a 192 amino acid protein with 88.0% homology with that of HTV- 1 
NL4-3. The vpr gene encodes a 96 amino acid protein with 89.6% homology with that 
of HTV- 1 NL4-3. 

25 There are 2 insertions and 1 deletion of sequences in the vpu gene. The insertions of 
3 and 9 nucleotides are after the equivalent of nucleotide 6071 and 6234, respectively, 
of HIV-1 NL4-3. These add 1 amino acid after amino acid 3, and 3 amino acids after 
amino acid 59 of the encoded Vpu protein. The deletion of 12 nucleotides after the 
equivalent of HTV- 1 NL4-3 nucleotide 6261 deletes 4 amino acids from the C-terminal 

30 region of Vpu as well as from the signal peptide of the em polyprotein, which is 
encoded by an overlapping reading frame. Amino acid sequence homology of HIV-1 

C18 MBC V P U ^ NL 4 ' 3 is 85 - 2% - 
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The sequence encoding the env gpl20 has 9 insertions totalling 45 nucleotides (encoding 
15 amino acids) and the deletion of a total of 18 nucleotides (encoding 6 amino acids). 
These are listed in Table 6. All of these events (insertions and deletions) are at positions 
in the env gene. This is within the env V3 coding region, immediately upstream of the 
5 sequence encoding the so called V3 tip (or loop) amino acid sequence, Gly Pro Gly Arg. 
The V3 region sequence is that of a typical clade B subtype (North America, Europe and 
Australia) being identical to the clade B consensus sequence (based on 186 env 
sequences) at 29/35 positions. The type of amino acid at positions 1 1 and 28 of the V3 
loop region (where position 1 is the Cys at amino acid 266 of the env gpl20) is 

10 predictive of the viral non-syncytium / syncytium forming phenotype (Fouchier et al, 
1992). The HIV-1 018^^ env gene encoded amino acid sequence has Ser at position 
1 1 and He at position 28 of the V3 loop region. The lack of a positively charged amino 
acid at both positions is strongly indicative of a non-syncytium viral phenotype. The 
overall amino acid sequence homology with HTV-1 NL4-3 (ignoring deletions and 

15 insertions) is 86.1%, comprising 85.5% for the gp 120 region and 87.6% for the gp41 
region. 

Both the tat and rev second exon open reading frames (ORF) are longer than in HTV-1 
NL4-3. A change of the tat termination codon from TAG to TCG extends the tat ORF 
20 to a downstream in phase termination codon extending the encoded tat amino acid 
sequence by 15 residues, compared with the 86 amino acid long NL4-3 tat protein, to 
a total length of 101 amino acids. However, this is the usual length of the HIV-1 tat 
protein. 

25 Similarly, the normal rev termination codon is changed from TAG to GAG. This 
extends the rev ORF to an in-phase termination codon 3 codons downstream so that the 
encoded Rev protein is 1 19 amino acids long instead of the usual 116. 

As mentioned above the most extensive differences between the sequences of the isolate 
30 HIV-1 ClSj^c and HIV-1 NL4-3 are in the nef gene and the LTR region. While the 
ne/gene overlaps the 3' LTR, these differences are found in both the ne/alone and the 
nef I LTR overlap regions. The HIV ClS^c-encoded nef protein is only 24 amino 
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acids long compared with the normal length of 206 amino acids. This severe shortening 
of the nef protein is due to the deletion of 188 nucleotides (the 177 and 11 nucleotide 
deletions) from the we/^alone region which also brings into phase a termination codon, 
TAG, at the resulting 25th codon. Downstream there is further loss of potential nef gene 
5 sequences by the 120 and 87 nucleotide deletions situated in the nef I LTR overlap 
region. The resulting 24 amino acid nef protein is identical to the N-terminus of the 
HIV-1 NL4-3 nefaX 9 of the first 10 positions. Thereafter, homology is lost completely. 

Some sequences used in the generation of mature mKNAs are altered or lost in 018^^ 

1 0 The dinucleotide immediately after the splice donor site 2 (SD2) at nts 48 1 8-48 1 9 (NL4- 
3 equivalent nts 4963-4964)is changed from the conserved GT to GC. It is expected 
that this change would lead to loss of function of this site as a splice donor. Splice 
donor 2 is used in the processing of HTV-1 transcripts to some of the mRNAs that 
encode Tat, Rev and nef proteins. Similarly the splice acceptor site 7 (SA7) sequence 

15 at nts 6477-6478 (NL4-3 equivalent nts 6602-6603) is changed from the conserved AG 
dinucleotide to TC. This change is expected to lead to loss of function of this site as 
a splice acceptor. While this SA site is used in HIV-1 mRNA processing it is not a 
major site and is not used in the production of the regulatory proteins (Tat, Rev or nef) 
mRNAs. The splice donor 12 site is absent from the ClSj^ sequence (NL43 

20 equivalent nts 9161-9162) as it is within the first deletion region in the nef I LTR 
overlap region which occurs at nt 8797 and results in the loss of NL43 nucleotides 9105 
to 9224. It is significant that the SA12 site is absent from the sequence of all of the 
cohort virus isolates so far obtained as well as from the sequence of D36 PBMC, 
however, the C54 PBMC sequence does contain the SA12 site. SA12 is not used in the 

25 processing of mRNAs that encode the viral regulatory proteins. Normally SA12 is used 
in splicing in conjunction with SD1, 2, 3 and 4 and the resulting spliced RNA is 
probably not a mRNA but may have a regulatory role involving binding to cellular 
proteins (Smith et al, 1992). 

30 An interesting feature of the sequence of the HTV-1 ClS^c isolate is the deletion and 
rearrangement of sequence from the 5'-LTR U3 region and the deletion of sequence 
from the nef gene (both nef alone and nef I V LTR regions). These being the only 
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features of the sequence distinct from disease-causing HTV-1. The lack of AIDS or 
AIDS-Iike symptoms in the patient CI 8 is attributed to the effects of the loss of LTR 
sequence and / or the loss of nef coding sequences and their role in the pathogenesis of 
AIDS. 

5 

TABLE 4 

Primers used to Amplify Overlapping regions of HTV-1 CU^qq 

10 Primer 5'- Direction Primer Sequence 

Coordinate (♦/-) Length (nt) 



15 



20 



25 



30 



35 



CL 1A 


1 


♦ 


30 


CL 14 


896 




25 


CL IB 


1 


+ 


30 


CL 13 


796 




31 


CL 11 


. 682 


. + 


23 


CL 18 


3440 




30 


CL 12A 


732 


♦ 


26 


CL 17 


3330 




32 


CL 26 


3193 


+ ■ 


39 


CL 6B - 


9671 




39 


CL 27 


3251 


+ 


39 


CL 2B 


639 




37 
• 


CL 19 


5448 




30 


CL 24 


8422 




28 



TGGAAGGGCTAATTTACTCCCAAAAAAGAC 



AATCCOGGGTGGAAGGGCTAATTTACTCCC 
CCTCT74GACCGCITAATACTGACGCTCTCGC 

TCTC7XGACGCAGQACTCGGCTT 
CTGTTTTCTG CCAGTTCTAG CTCTGCTTCT 

TTTCCCG GGCGG CGACTGGTGAGT AC 
£CCrcraGACTTGCCCAATTCAATTTTCCCAC 

OOLCACCAGACAAAAAGCftlCK^ 
TtXnMAGAlTllXXaiCAaSSAl^^ 

CCAICCIGAIAAATtXJAC»SCACAAt^^ 
TGGCCCAAACATTATGTACCTCT6CATCATATGC 

AGCAGGACATAACAAGGTAGGATCTCTACA 



Underlined sequences depict added restriction enzyme site 

+ and - orientations refer to sense and antisense strands of the double stranded DNA 
sequence 
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TABLE 5 

Sequence Deletions and Insertions in HTV-1 C18 
Compared with HTV-1 NL43 



MBC* 



10 



15 



20 



25 



30 



35 



40 



Gene or Region 


Position 
C18„b C 


(nt) 
NL43 


Deletions 
(nt) 


Insertions 
(nt) 


5' -LTR U3 


29 


29 


120 


- 


5 ' -LTR U3 


85 


205 


87 


- 


5' -LTR U3 


154 


360 


- 


9 


gag pi7 


939 


1134 


- 


15 


9*g pis 


1982 


2163 


- 


30 


gag pl5 


2081 


2232 


- 


6 


ypu 


5927 


6062 


- 


3 


vpu/env 


6092 


6234 


- 


9 


vpu/env 


6128 


6261 


12 


- 


env 


6483 


6628 


- 


6 


env 


6514 


6653 


2 


_ 


env 


6524 


6665 


1 


_ 


env 


6630 


6772 




9 


env 


6646 


6778 


_ 


3 


env 


7011 


7141 


6 




env 


7140 


7276 


3 




env 


7195 


7334 




6 


env 


7266 


7399 


3 




env 


7278 


7414 




6 


env 


7290 


7420 




2 


env 


7300 


7429 




. 1 


env 


7314 


7441 


3 




env 


7463 


7593 




3 


env 


7471 


7598 




9 


nef 


8711 


8829 


177 




nef 


8723 


9018 


11 




nef / LTR 


8798 


9104 


120 




nef / LTR 


8854 


9280 


87 




LTR U3 


8923 


9435 







632 



126 
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EXAMPLE 13 

Macrophage Isolates of HIV-1 CI 8 and HIV- J C98 

5 HIV-1 has been isolated from the macrophages of patients C18 and C98. 

Patient monocytes were prepared as follows. Whole blood was spun at 2000rpm for 10 
minutes. Plasma was removed into a separate tube and the remaining cells were diluted 
1:2 in PBS" (magnesium and calcium free phosphate buffered saline). This was 
10 underlaid with 10 ml of Ficoll Isopaque and spun at 2000rpm for 20 minutes. Cells 
were collected from the interphase and washed three times with PBS". These cells were 
then seeded into a 6 well Costar tray at a concentration of 1.0 x 10 7 /ml and allowed to 
adhere for 1 hour. Any non-adherent cells were removed by aspiration. 

15 Donor HIV-1 negative macrophages for use in co-cultivation were prepared as follows. 
Peripheral blood mononuclear cells were purified from whole blood using 
Ficoll/Isopaque density gradient. These cells were seeded at a concentration of 2.0 x 
10 6 /ml in teflon. PBMC were cultured in the presence of 3/xg/ml of PHA and 1 OOOU/ml 
of M-CSF 3 days prior to co-culture. 

20 

On day of co-culture, donor PBMC were CD8 depleted. Dyna beads coated with anti- 
CD8 were used for this purpose. Dyna beads were washed once in PBS" and then 
applied to a magnet for 3 minutes. Supernatant was removed and the beads were then 
resuspended in 250/d of RF-10. Aliquots of 2.0 x 10 8 patient cells were then added to 

25 250^1 (3 beads : 1 CD8 T-cell) of Dyna beads and allowed to incubate for 30 minutes 
on ice with occasional mixing. After 30 minutes the cell suspension was placed onto 
a magnet for 3 minutes. The supernatant was then removed placed into a second tube 
containing 142 fil (1 bead : 1 CD8 T-cell) of Dyna beads. This suspension was placed 
on ice for an additional 30 minutes with occasional mixing. After 30 minutes cell 

30 suspension was placed onto a magnet for 3 minutes. Supernatant was removed and 
washed once in RF-10. 
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For co-culture, CD8 depleted PBMC were then added to patient monocytes. Half media 
changes were done every 7 days for a period of 21 days. Aliquots of 2.5 ml of medium 
was removed from these cultures and replaced with CD8 depleted donor PBMC in 
Iscoves containing 10% HuS (Human serum), 5% v/v FCS and 5% w/v IL-2 and 
5 lOOOU/ml of M-CSF. Harvested supernatants were spun at 1400 rpm for 10 minutes 
and stored as 1ml aliquots. Cell pellets were lysed in 200^1 of lysis buffer for PCR 
analysis. Infection was quantitated using a p24 EIA Kit. 

Cells were harvested from the co-cultures and used to prepare DNA as described above. 
10 The nef I 3'-LTR region of both virus isolates was amplified by PCR using the above 
described primer sets and conditions (Example 12). The resulting amplimers were 
cloned into the plasmid vector pT7T3U19 and the nucleotide sequence determined by 
the Taq cycle sequencing method with dye-labelled primers. 

IS The CI 8 macrophage sequence has 3 deletions starting and finishing at positions within 
3 nucleotides of the same deletions in CI The encoded nef protein is 3 amino 

acids long compared with 7 amino acids for CI The low homology region of the 
LTR U 3 region of CI 8 macrophage is very similar in sequence to ClS^c and 
similarly it has one extra upstream NFkB site. 

20 

On the other hand, the sequence of C98 macrophage has a number of differences from 
the C98 isolate. While it has exactly the same first deletion of 16 nucleotides just 
upstream of the polypurine tract (PPT), in the zie^alone region, and exactly the same 
second deletion (position and size) it has an extra deletion of 18 nucleotides at NL4-3 
25 equivalent nucleotides 9206 to 9223. The final deletion is in approximately the same 
position as in the C98 isolate but is 5 nucleotides longer. The encoded nef protein is 34 
amino acids long compared with 86 amino acids for the C98 isolate. The low homology 
region is very similar to the C98 isolate, having the same 2 extra upstream NFkB sites 
and completely lacking the normal 5' -NFkB site. 

30 
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EXAMPLE 14 

Construction and Use of an Infectious Molecular Clone 

5 Molecular biological techniques can be used to construct a molecular clone of, for 
example, HTV-1 (318^^ Two schemes may be used. In the first scheme genomic 
DNA, extracted from either the CD4 positive PBMC of the patient CI 8 or donor PBMC 
that have been infected with the isolate HIV-1 ClS^c , is used as the template for 
polymerase chain reaction amplification, using thermostable polymerase of high 

10 transcriptional fidelity (eg UlTma polymerase or KlenTaq/Pfu polymerase mixture), of 
long (6 to 7 kb) overlapping fragments representing the 5'- and 3 '-parts of the HIV-1 
C18 MBC proviral genome of total length 9207nts. The amplified fragments may then 
be ligated together after digestion with a restriction enzyme that cleaves at a unique site 
common to the overlapping region of the amplified fragments, for example the unique 

15 Bgl I or Nco I sites. Ligation of this full length proviral DNA into a plasmid vector 
will allow its propagation in E coli and the subsequent preparation of large (mg) 
quantities of this molecularly cloned proviral DNA. 

In the second scheme donor PBMC that have been infected with the isolate HIV-1 
20 ClS^c are used as a source of non-integrated proviral DNA which can be extracted 
from the infected cells by the Hirt extraction method (Hirt, 1967). Circular proviral 
DNA molecules may be linearised by digestion with a restriction enzyme that cleaves 
at a unique position in the genome (eg the Bgl I or Nco I sites). The resitting linearised 
molecules can be ligated into a plasmid or, more usually, a bacteriophage lambda (X) 
25 based vector (eg Charon 4a, AWES) after modification of the end to provide blunt or 
cohesive ends compatible with the vector. Transformation or transduction of E coli with 
the recombinant plasmid or bacteriophage material, respectively allows the propagation 
of the proviral DNA. Clones of E coli containing proviral DNA may be selected and 
DNA prepared. Molecular clones of retroviral genomes prepared in this way are often 
30 permuted. Rearrangement to the functional arrangement of sequences is achieved by 
restriction enzyme cleavage and religation of fragments to reconstruct the correctly 
permuted proviral genome. 
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The molecularly cloned DNA products of both schemes can be used to prepare variant 
proviral genomes that may be used as the basis of a biologically attenuated HIV-1 
vaccine strain. Similarly, they may be modified to contain extra DNA sequences in the 
we/^alone deletion region that may deliver sequences that may be of therapeutic 
5 advantage (eg antisense or ribozyme sequences). 

Infectious vims particles of HIV-1 ClS^c , or modified virus, can be produced by 
transfection of human cells (eg HeLa cells) which will produce, and release to the 
culture medium, virus particles of HIV-1 ClSj^ , or modified virus. These virus 
10 particles can be used to infect a variety of CD4 positive cells for further propagation or 
experimentation. 

EXAMPLE 15 

In vivo Primate Model 

15 Following construction of infectious molecular clones of the mutant HIV-1 strains, 
studies are then undertaken in primates to establish attenuation, immunogenicity and 
vaccine prophylactic efficacy. All studies compare mutant clones of HIV-1 with 
isogenic wild-type (WT) virus. Initial studies are performed using the macaque (M. 
nemistrina) model of HIV-1 infection. Macaque-infectious WT HIV-1 and mutant 

20 clones are compared with respect to duration of viremia, anatomic sites of replication, 
and cellular and humoral immune responses. Where the mutant HTV-1 clones induce 
an immune response in the macques infected, challenge studies with WT virus are also 
performed. Studies are performed in a limited number of chimpanzees, generally in 
parallel with the macaque studies. Relevant mutations are engineered into WT HIV-1 

25 clones previously shown to produce chronic infection in chimps, and the course of chimp 
infection with mutant clones compared with historical controls. If infection is 
established, WT challenge studies is also performed. 
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Those skilled in the art will appreciate that the invention described herein is susceptible 
to variations and modifications other than those specifically described. It is to be 
understood that the invention includes all such variations and modifications. The 
5 invention also includes all of the steps, features, compositions and compounds referred 
to or indicated in this specification, individually or collectively, and any and all 
combinations of any two or more of said steps or features. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i)(a) APPLICANT: (Other than US) 
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AUSTRALIAN RED CROSS SOCIETY (NSW DIVISION) 

(b) INVENTORS/APPLICANTS (US only): 
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(ii) TITLE OF INVENTION: NON-PATHOGENIC STRAINS OF HIV-1 

(iii) NUMBER OF SEQUENCES: 800 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DAVIES COLLISON CAVE 

(B) STREET: 1 LITTLE COLLINS STREET 

(C) CITY: MELBOURNE 

(D) STATE: VICTORIA 

(E) COUNTRY: AUSTRALIA 

(F) ZIP: 3000 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #L0, Version #125 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT INTERNATIONAL 

(B) FILING DATE: 14-FEB-1995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PM3864 (AU) 

(B) FILING DATE: 14-FEB-1994 

(A) APPLICATION NUMBER: PM4002 (AU) 

(B) FILING DATE: 21-FEB-1994 

(A) APPLICATION NUMBER: PN0284 (AU) 
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(C) REFERENCE/DOCKET NUMBER: EJH/EK 

(ix) TELECOMMUNICATION INFORMATION: 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9709 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

GO MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



TGGAAGGGCT AATTTGGTCC CAAAAAAGAC AAGAGATCCT TGATCTGTGG ATCTACCACA 60 

CACAAGGCTA CTTCCCTGAT TGGCAGAACT ACACACCAGG GCCAGGGATC AGATATCCAC 120 

TGACCTTTGG ATGGTGCTTC AAGTTAGTAC CAGTTGAACC AGAGCAAGTA GAAGAGGCCA 180 

AATAAGGAGA GAAGAACAGC TTGTTACACC CTATGAGCCA GCATGGGATG GAGGACCCGG 240 

AGGGAGAAGT ATTAGTGTGG AAGTTTGACA GCCTCCTAGC ATTTCGTCAC ATGGCCCGAG 300 

AGCTGCATCC GGAGTACTAC AAAGACTGCT GACATCGAGC TTTCTACAAG GGACTTTCCG 360. 

CTGGGGACTT TCCAGGGAGG TGTGGCCTGG GCGGGACTGO GGAGTGGCGA GCCCTCAGAT 420 

GCTACATATA AGCAGCTGCT TTTTGCCTGT ACTOGGTCTC TCTGGTTAGA CCAGATCTGA 480 

GCCIGGQAGC TCTCTGGCTA ACTAGGGAAC CCACTGCTTA AGCCTCAATA AAGCTTGCCT 540 

TGAGTGCTCA AAGTAGTGTG TGCCOGTCTG TTGTGTGACT CTGGTAACTA GAGATCCCTC 600 

AGACCCTTTT AGTCAGTGTG GAAAATCTCT AGCAGTGGOG CCCGAACAGG GACTTGAAAG 660 

CGAAAGTAAA GCCAGAGGAG ATCTCTCGAC GCAGGACTCG GCTTGCTGAA GCGCGCACGG 720 

CAAGAGGCGA GGGGOGGCGA CTGGTGAGTA CGCCAAAAAT TTTGACTAGC GGAGGCTAGA 780 

AGGAGAGAGA TGGGTGOGAG AGCGTCGGTA TTAAGCGGGG GAGAATTAGA TAAATGGGAA 840 

AAAATTCGGT TAAGGCCAGG GGGAAAGAAA CAATATAAAC TAAAACATAT AGTATGGGCA 900 

AGCAGGGAGC TAGAACGATT OGCAGTTAAT CCTGGCCTTT TAGAGACATC AGAAGGCIGT 960 

AGACAAATAC TGGGACAGCT ACAACCATCC CTTCAGACAG GATCAGAAGA ACTTAGATCA 1020 

TTATATAATA CAATAGCAGT CCTCTATTGT GTGCATCAAA GGATAGATGT AAAAGACACC 1080 

AAGGAAGCCT TAGATAAGAT AGAGGAAGAG CAAAACAAAA GTAAGAAAAA GGCACAGCAA 1140 

GCAGCAGCTG ACACAGGAAA CAACAGCCAG GTCAGCCAAA ATTACCCTAT AGTGCAGAAC 1200 

CTCCAGGGGC AAATGGTACA TCAGGCCATA TCACCTAGAA CTTTAAATGC ATGGGTAAAA 1260 

GTAGTAGAAG AGAAGGCTTT CAGCCCAGAA GTAATACCCA TGTTTTCAGC ATTATCAGAA 1320 

GGAGCCACCC CACAAGATTT AAATACCATG CTAAACACAG TGGGGGGACA TCAAGCAGCC 1380 

ATGCAAATGT TAAAAGAGAC CATCAATGAG GAAGCTGCAG AATGGGATAG ATTGCATCCA 1440 

GTGCATGCAG GGCCIATTGC ACCAGGCCAG ATGAGAGAAC CAAGGGGAAG TGACATAGCA 1500 

GGAACTACTA GTACCCTTCA GGAACAAATA GGATGGATGA CACATAATCC ACCTATCCCA 1560 

GTAGGAGAAA TCTATAAAAG ATGGATAATC CTGGGATTAA ATAAAATAGT AAGAATGTAT 1620 

AGCCCTACCA GCATTCTGGA CATAAGACAA GGACCAAAGG AACCCTTTAG AGACTATGTA 1680 

GACOGATTCT ATAAAACTCT AAGAGCCGAG CAAGCTTCAC AAGAGGTAAA AAATTGGATG 1740 

ACAGAAACCT TGTTGGTCCA AAATGCGAAC CCAGATTGTA AGACTATTTT AAAAGCATTG 1800 

GGACCAGGAG CGACACTAGA AGAAATGATG ACAGCATGTC AGGGAGTGGG GGGACCCGGC 1860 

CATAAAGCAA GAGTTTTGGC TGAAGCAATG AGCCAAGTAA CAAATCCAGC TACCATAATG 1920 

ATACAGAAAG GCAATTTTAG GAACCAAAGA AAGACTGTTA AGTGTTTCAA TTGTGGCAAA 1980 

GAAGGGCACA TAGCCAAAAA TTGCAGGGCC CCTAGGAAAA AGGGCTGTTG GAAATGTGGA 2040 

AAGGAAGGAC ACCAAATGAA AGATTGTACT GAGAGACAGG CTAATTTTTT AGGGAAGATC 2100 

TGGCCTTCCC ACAAGGGAAG GCCAGGGAAT TTTCTTCAGA GCAGACCAGA GCCAACAGCC 2160 

CCACCAGAAG AGAGCITCAG GTTTGGGGAA QAGACAACAA CTCCCTCTCA GAAGCAGGAG 2220 

COGATAGACA AGGAACTGTA TCCTTTAGCT TCCCTCAOAT CACTCTTTGG CAOOGACOCC ■ 2280 
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TCGTCACAAT AAAGATAGGG GGGCAATTAA AGGAAGCTCT ATTAGATACA GGAGCAGATG 2340 

ATACAGTATT AGAAGAAATG AATTTGCCAG GAAGATGGAA ACCAAAAATG ATAGGGGGAA 2400 

TTGGAGGTTT TATCAAAGTA GGACAGTATG ATCAGATACT CATAGAAATC TGCGGACATA 2460 

AAGCTATAGG TACAGTATTA GTAGGACCTA CACCTGTCAA CATAATTGGA AGAAATCTGT 2520 

TGACTCAGAT TGGCTGCACT TTAAATTTTC CCATTAGTCC TATTGAGACT GTACCAGTAA 2580 

AATTAAAGCC AGGAATGGAT GGCCCAAAAG TTAAACAATG GCCATTGACA GAAGAAAAAA 2640 

TAAAAGCATT AGTAGAAATT TGTACAGAAA TGGAAAAGGA AGGAAAAATT TCAAAAATTG 2700 

GGCCTGAAAA TCCATACAAT ACTCCAGTAT TTGCCATAAA GAAAAAAGAC AGTACTAAAT 2760 

GGAGAAAATT AGTAGATTTC AGAGAACTTA ATAAGAGAAC TCAAGATTTC TGGGAAGTTC 2820 

AATTAGGAAT ACCACATCCT GCAGGGTTAA AACAGAAAAA ATCAGTAACA GTACTGGATG 2680 

TGGGCGATGC ATATTTTTCA GTTCCCTTAG ATAAAGACTT CAGGAAGTAT ACTGCATTTA 2940 

CCATACCTAG TATAAACAAT GAGACACCAG GGATTAGATA TCAGTACAAT GTGCTTCCAC 3000 

AGGGATGGAA AGGATCACCA GCAATATTCC AGTGTAGCAT GACAAAAATC TTAGAGCCTT 3060 

TTAGAAAACA AAATCCAGAC ATAGTCATCT ATCAATACAT GGATGATTTG TATGTAGGAT 3120 

CTGACTTAGA AAXAGGGCAG CATAGAACAA AAATAGAGGA ACTGAGACAA CATCTGTTGA 3180 

GGTGGGGATT TACCACACCA GACAAAAAAC ATCAGAAAGA ACCTCCATTC CTTTGGATGG 3240 

GTTATGAACT CCATCCTGAT AAATGGACAG TACAGCCTAT AGTGCTGCCA GAAAAGGACA 3300 

GC1GGACTGT CAATGACATA CAGAAATTAG TGGGAAAATT GAATTGGGCA AGTCAGA1TT 3360 

ATGCAGGGAT TAAAGTAAGG CAATXATGTA AACTTCTTAG GGGAACCAAA GCACTAACAG 3420 

AAGTAGTACC ACTAACAGAA GAAGCAGAGC TAGAACTGGC AGAAAACAGG GAGATTCTAA 3480 

AAGAACCGGT ACATGGAGTG TATTATGACC CATCAAAAGA CTTAATAOCA GAAATACAGA 3540 

AGCAGGGGCA AGGCCAATGG ACATATCAAA TTTATCAAGA GCCATTTAAA AATCTGAAAA 3600 

CAGGAAAATA TGCAAGAATG AAGGGTGCCC ACACTAATGA TGTGAAACAA TTAACAGAGG 3660 

CAGTACAAAA AATAGCCACA GAAAGCATAG TAATATGGGG AAAGACTCCT AAATTTAAAT 3720 

TACCCATACA AAAGGAAACA TGGGAAGCAT GGTGGACAGA GTATTGGCAA GCCACCTGGA 3780 

TTCCTGAGTG GGAGTTTGTC AATACCCCTC CCTTAGTGAA GTTATGGTAC CAGTTAGAGA 3840 

AAGAACCCAT AATAGGAGCA GAAACTTTCT ATGTAGATGG GGCAGCCAAT AQGGAAACXA 3900 

AATTAGGAAA AGCAGGATAT GTAACTGACA GAGGAAGACA AAAAGTTGTC CCOCTAACGG 3960 

ACACAACAAA TCAGAAGACT GAGTTACAAG CAATTCATCT AGCTTTGCAG GATTCGGGAT 4020 

TAGAAGTAAA CATAGTGACA GACTCACAAT ATGCATTGGG AATCATTCAA GCACAACCAG 4080 

ATAAGAGTOA ATCAGAGTTA GTCAGTCAAA TAATAGAGCA GTTAATAAAA AAGGAAAAAG 4140 

TCTACCTGGC ATGGGTACCA GCACACAAAG GAATTGGAGG AAATGAACAA GTAGATGGGT 4200 

TGGTCAGTGC TGGAATCAGG AAAGTACTAT TTTTAGATGG AATAGATAAG GCCCAAGAAG 4260 

AACATOAGAA ATATCACAGT AATTGGAGAG CAATGGCXAG TGATTTTAAC CTACCACCTG 4320 

TAGTAGCAAA AGAAAXAGTA GCCAGCTGTO ATAAATGTCA GCTAAAAGGG GAAGCCATGC 4380 

ATGGACAAGT AGACTG7AGC CCAGGAATAT GGCAGCTAGA TTGTACACAT TTAGAAGGAA 4440 

AAGTTATCTT GGTAGCAGTT CATGTAGCCA GTGGATATAT AGAAGCAGAA GTAATTCCAG 4500 

CAGAGACAGG GCAAGAAACA GCATACTTCC TCTTAAAATT AGCAGGAAGA TGGCCAGTAA 4560 

AAACAGTACA TACAGACAAT GGCAGCAATT TCACCAGTAC TACAGTTAAG GCCGCCTGTT 4620 

GGTGGGCGGG GATCAAGCAG GAATTTGGCA TTCCCTACAA TCCCCAAAGT CAAGGAGTAA 4680 

TAGAATCTAT GAATAAAGAA TTAAAGAAAA TTATAGGACA GGTAAGAGAT CAGGCTGAAC 4740 

ATCTTAAGAC AGCAGTACAA ATGGCAGTAT TCATCCACAA TTTTAAAAGA AAAGGGGGGA 4800 

TTGGGGGGTA CAGTGCAGGG GAAAGAATAG TAGACATAAT AGCAACAGAC ATACAAACTA 4860 

AAGAATTACA AAAACAAATT ACAAAAATTC AAAATTTTCG GGTTTATTAC AGGGACAGCA 4920 

GAGATCCAGT TTGGAAAGGA CCAGCAAAGC TCCTCTGGAA AGGTGAAGGG GCAGTAGTAA 4980 

TACAAGATAA TAGTGACATA AAAGTAGTGC CAAGAAGAAA AGCAAAGATC ATCAGGGATT 5040 

ATGGAAAACA GATGGCAGGT GATGATTGTG TGGCAAGTAG ACAGGATGAG GATTAACACA 5100 
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TGGAAAAGAT TAGTAAAACA CCATATGTAT ATTTCAAGGA AAGCTAAGGA CTGGTTTTAT 5160 

AGACATCACT ATGAAAGTAC TAATCCAAAA ATAAGTTCAG AAGTACACAT CCCACTAGGG 5220 

GATGCTAAAT TAGTAATAAC AACATATTGG GGTCTGCATA CAGGAGAAAG AGACTGGCAT 5280 

TTGGGTCAGG GAGTCTCCAT AGAATGGAGG AAAAAGAGAT ATAGCACACA AGTAGACCCT 5340 

GACCTAGCAG ACCAACTAAT TCATCTGCAC TATTTTGATT GTTTTTCAGA ATCTGCTATA 5400 

AGAAATACCA TATTAGGACG TATAGTTAGT CCTAGGTGTG AATATCAAGC AGGACATAAC 5460 

AAGGTAGGAT CTCTACAGTA CTTGGCACTA GCAGCATTAA TAAAACCAAA ACAGATAAAG 5520 

CCACCTTTGC CTAGTGTTAG GAAACTGACA GAGGACAGAT GGAACAAGCC CCAGAAGACC 55 BO 

AAGGGCCACA GAGGGAGCCA TACAATGAAT GGACACTAGA GCTTTTAGAG GAACTTAAGA 5640 

GTGAAGCTGT TAGACATTTT CCTAGGATAT GGCTCCATAA CTXAGGACAA CATATCTATG 5700 

AAACTTACGG GGATACTTGG GCAGGAGTGG AAGCCATAAT AAGAATTCTG CAACAACTGC 5760 

TGTTTATCCA TTTCAGAATT GGGTGTCGAC ATAGCAGAAT AGGCGTTACT CGACAGAGGA 5820 

GAGCAAGAAA TGGAGCCAGT AGATCCTAGA CTAGAGCCCT GGAAGCATCC AGGAAGTCAG 5880 

CCTAAAACTG CTTGTACCAA TTGCTATTGT AAAAAGTGTT GCTTTCATTG CCAAGTTTGT 5940 

TTCATGACAA AAGCCTTAGG CATCTCCTAT GGCAGGAAGA AGCGGAGACA GCGACGAAGA 6000 

GCTCATCAGA ACAGTCAGAC TCATCAAGCT TCTCTATCAA AGCAGTAAGT AGTACATGTA 6060 

ATGCAACCTA TAATAGTAGC AATAGTAGCA TTAGTAGTAG CAATAATAAT AGCAATAGTT 6120 

GTGTGGTCCA TAGTAATCAT AGAATATAGG AAAATATTAA GACAAAGAAA AATAGACAGG 6180 

TTAATTGATA GACTAATAGA AAGAGCAQAA GACAGTGGCA ATGAGAGTGA AGGAGAAGTA 6240 

TCAGCACTTG TGGAGATGGG GGTGGAAATG GGGCACCATG CTCCTTGGGA TATTGATGAT 6300 

CTGTAGTGCT ACAGAAAAAT TGTGGGTCAC AGTCTATTAT GGGGTACCTQ TGTGGAAGGA 6360 

AGCAACCACC ACTCTATTTT GTGCATCAGA TGCTAAAGCA TATGATACAG AGGTACATAA 6420 

TGTTTGOGCC ACACATGCCT GTGTACCCAC AGACCCCAAC CCACAAGAAG TAGTATTGGT 6480 

AAATGTGACA GAAAATTTTA ACATGTGGAA AAATGACATG GTAGAACAGA TGCATGAGGA 6540 

TATAATCAGT TTATGGGATC AAAGCCTAAA GCCATGTGTA AAATTAACCC CACTCTGTGT 6600 

TAGTTTAAAG TGCACTGATT TGAAGAATGA TACTAATACC AATAGTAGTA GOGGGAGAAT 6660 

GATAATGGAG AAAGGAGAGA TAAAAAACTG CTCTTTCAAT ATCAGCACAA GCATAAGAGA 6720 

TAAGGTGCAG AAAGAATATG CATTCTTTTA TAAACTTGAT ATAGTACCAA TAGATAATAC 6780 

CAGCTATAGG TTGATAAGTT GTAACACCTC AGTCATTACA CAGGCCTGTC CAAAGGTATC 6840 

CTTTGAGCCA ATTCCCATAC ATTATTGTGC CCCGGCTGGT TTTGCGATTC TAAAATGTAA 6900 

TAATAAGACG TTCAATGGAA CAGGACCATG TACAAATGTC AGCACAGTAC AATGTACACA 6960 

TGGAATCAGG CCAGTAGTAT CAACTCAACT GCTGTTAAAT GGCAGTCTAG CAGAAGAAGA 7020 

TGTAGTAATT AGATCTGCCA ATTTCACAGA CAATGCTAAA ACCATAATAG TACAGCTGAA 7060 

CACATCTGTA GAAATTAATT GTACAAGACC CAACAACAAT ACAAGAAAAA GTATCCGTAT 7140 

CCAGAGGGGA CCAGGGAGAG CATTTGTTAC AATAGGAAAA ATAGGAAATA TGAGACAAGC 7200 

ACATTGTAAC ATTAGTAGAG CAAAATGGAA TGCCACITTA AAACAGATAG CTAGCAAATT 7260 

AAGAGAACAA TTTGGAAATA ATAAAACAAT AATCTTTAAG CAATCCTCAG GAGGGGACCC 7320 

AGAAATTGTA AOGCACAGTT TTAATTGTGG AGGGGAATTT TTCTACTGTA ATTCAACACA 7380 

ACTGTTTAAT AGTACTTGGT TTAATAGTAC TTGGAGTACT GAAGGGTCAA ATAACACTGA 7440 

AGGAAGTGAC ACAATCACAC TCCCATGCAG AATAAAACAA TTTATAAACA TGTGGCAGGA 7500 

AGTAGGAAAA GCAATGTATG CCCCTCCCAT CAGTGGACAA ATTAGATGTT CATCAAATAT 7560 

TACTGGGCTG CTATTAACAA GAGATGGTGG TAATAACAAC AATGGGTCCG AGATCTTCAG 7620 

ACCTGGAGGA GGCGATATGA GGGACAATTG GAGAAGTGAA TTATATAAAT ATAAAGTAGT 7680 

AAAAATTGAA CCATTAGGAG TAGCACCCAC CAAGGCAAAG AGAAGAGTGG TGCAGAGAGA 7740 

AAAAAGAGCA GTGGGAATAG GAGCTTTGTT CCTTGGGTTC TTGGGAGCAG CAGGAAGCAC 7800 

TATGGGCTGC AOGTCAATGA CGCTGAOGGT ACAGGCCAGA CAATXATTGT CTGATATAGT 7860 

GCAGCAGCAG AACAATTTGC TGAGGGCTAT TGAGGCGCAA CAGCATCTGT TGCAACTCAC 7920 
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AGTCTGGGGC ATCAAACAGC TCCAGGCAAG AATCCTGGCT GTGGAAAGAT ACCTAAAGGA 7980 
TCAACAGCTC CTGGGGATTT GGGGTTGCTC TGGAAAACTC ATTTGCACCA CTGCTGTGCC 8040 
TTGGAATGCT AGTTGGAGTA ATAAATCTCT GGAACAGATT TGGAATAACA TGACCTGGAT 8100 
GGAGTGGGAC AGAQAAATTA ACAATTACAC AAGCTTAATA CACTCCTTAA TTGAAGAATC 8160 
GCAAAACCAG CAAGAAAAGA ATGAACAAGA ATTATTGGAA TTAGATAAAT GGGCAAGTTT 8220 
GTGGAATTGG TTTAACATAA CAAATTGGCT GTGGTATATA AAATTATTCA TAATGATAGT 8280 
AGGAGGCTTG GTAGGTTTAA GAATAGTTTT TGCTGTACTT TCTATAGTGA ATAGAGTTAG 8340 
GCAGGGATAT TCACCATTAT CGTTTCAGAC CCACCTCCCA ATCCCGAGGG GACCCGACAG 8400 
GCCCGAAGGA ATAGAAGAAG AAGGTGGAGA GAGAGACAGA GACAGATCCA TTCGATTAGT 8460 

GAACGGATCC TTAGCACTTA TCTGGGACGA TCTGCGGAGC CTGTGCCTCT TCAGCTACCA 8520 

CCGCTTGAGA GACTTACTCT TGATTGTAAC GAGGATTGTG GAACTTCTGG GACGCAGGGG 8580 

GTGGGAAGCC CTCAAATATT GGTGGAATCT CCTACAGTAT TGGAGTCAGG AACTAAAGAA 8640 

TAGTGCTGTT AACTTGCTCA ATGCCACAGC CATAGCAGTA GCTGAGGGGA CAGATAGGGT 8700 

TATAGAAGTA TTACAAGCAG CTTATAGAGC TATTCGCCAC ATACCTAGAA GAATAAGACA 8760 

GGGCTTGGAA AGGATTTTGC TATAAGATGG GTGGCAAGTG GTCAAAAAGT AGTGTGATTG 8820 

GATGGCCTGC TGTAAGGGAA AGAATGAGAC GAGCTGAGCC AGCAGCAGAT GGGGTGGGAG 8880 

CAGTATCTCG AGACCTAGAA AAACATGGAG CAATCACAAG TAGCAATACA GCAGCTAACA 8940 

ATGCTGCTTG TGCCTGGCTA QAAGCACAAG AGGAGGAAGA GGTGGGTTTT OCAGTCACAC 9000 

CrCAGGTACC TTTAAGACCA ATGACTTACA AGGCAGCTGT AGATC1TAGC CACTTTTTAA 9060 

AAGAAAAGGG GGGACTGGAA GGGCTAATTC ACTCCCAAAG AAOACAAOAT ATCCTTGATC 9120 

TGTGGATCTA CCACACACAA GGCIACTTCC CTGATTGGCA GAACTACACA CCAGGGCCAG 9180 

GGGTCAGATA TCCACTGACC TTTGGATGGT GCTACAAGCT AGTACGAGTT GAGCCAGATA 9240 

AGGTAGAAGA GGCCAATAAA GGAGAGAACA CCAGCTTGTT ACACCCTGTG AGCCTGCATO 9300 

GAATGGATGA CCCTGAGAGA GAAGTGTTAG AGTGGAGGTT TGACAGCCGC CTAGCATTTC 9360 

ATCACGTGGC COGAGAGCTX3 CATCCGGAGT ACTTCAAGAA CTGCTGACAT CGAGCTTGCT 9420 

ACAAGGGACT TTCCGCTGGG GACTTTCCAG GGAGGOGTGG CCTGGGCGGG ACTGGGGAGT 9480 

GGCGAGCCCT CAGATGCTGC ATATAAGCAG CIWATITTU CCTGT A CTGG GTCTCTCTGG 9540 

TTAGACCAGA TCTGAGCCTG GGAGCTCTCT GGCTAACTAG GGAACCCACT GCTTAAGCCT 9600 

CAATAAAGCT TGCCTTGAGT GCTTCAAGTA GTGTGTGCCC GTCTGTTGTG TGACTCTGGT 9660 

AACTAGAGAT CCCTCAGACC CTTTTAGTCA GTGTGGAAAA TCTCTAGCA 9709 
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(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

ATGGGTGGCA 

(2) INFORMATION FOR SEQ ID N03: 

0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO J: 
TGGGTGGCAA 

(2) INFORMATION FOR SEQ ID NO:4: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
GGGTGGCAAG 

(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
GGTGGCAAGT 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 
GTGGCAAGTG 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

TGGCAAGTGG 

(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
GGCAAGTGGT 

(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO.-9: 
GCAAGTGGTC 

(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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CAAGTGGTCA 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

AAGTGGTCAA 

(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

AGTGGTCAAA 

(2) INFORMATION FOR SEQ ID NO:I3: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GTGGTCAAAA 

(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

fu) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TGGTCAAAAA 

(2) INFORMATION FOR SEQ ID NO:15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
GGTCAAAAAG 

(2) INFORMATION FOR SEQ ID NO:16: 

0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 16: 
GTCAAAAAGT 

(2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
p) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
TCAAAAAGTA 

(2) INFORMATION FOR SEQ ID NO:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CAAAAAGTAG 

(2) INFORMATION FOR SEQ ID NO:19: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AAAAAGTAGT 
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(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECUUE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:20: 
AAAAGTAGTG 

(2) INFORMATION FOR SEQ ID N021 : 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: DNA 
(x0 SEQUENCE DESCRIPTION: SEQ ID N02I: 
AAAGTAGTGT 

(2) INFORMATION FOR SEQ ID N0i2: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N022: 
AAGTAGTGTG 

(2) INFORMATION FOR SEQ ID N023: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE* DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N023: 

AGTAGTGTGA 

(2) INFORMATION FOR SEQ ID NOS4: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:24: 
GTAGTGTGAT 

(2) INFORMATION FOR SEQ ID NO-25: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

TAGTGTGATT 

(2) INFORMATION FOR SEQ ID N026: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N026: 
AGTGTGATTG 

(2) INFORMATION FOR SEQ ID N027: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(x0 SEQUENCE DESCRIPTION: SEQ ID NO-27: . 
GTGTGATTGG 

(2) INFORMATION FOR SEQ ID N028: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N028: 

TGTCATTGGA 

(2) INFORMATION FOR SEQ ID NO-29: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N029: 

GTGATTGGAT 

(2) INFORMATION FOR SEQ ID NO30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO30: 

TGATTGGATG 

(2) INFORMATION FOR SEQ ID N031: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N031: 

GATTGGATGG 

(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

ATTGGATGGC 

(2) INFORMATION FOR SEQ ID N033: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
TTGGATGGCC 

(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
TGGATGGCCT 

(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOUECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N035: 
GGATGGCCTG 

(2) INFORMATION FOR SEQ ID NO 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N036: 
GATGGCCTGC 

(2) INFORMATION FOR SEQ ID N037: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

ATGGCCTGCT 

(2) INFORMATION FOR SEQ ID N038: 
(0 SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
TGGCCTGCTG 



CTGCTGTAAG 



(2) INFORMATION FOR SEQ ID NO:43: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID N039: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N039: 

GGCCTGCTGT 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
TGCTGTAAGG 



(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO:40: 

(0 SEQUENCE CHARACTERISTICS: 
. : (A) LENGTH: 10 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

GCCTGCTGTA 



(ii) MOLECULE TYPE: DNA 
(xO SEQUENCE DESCRIPTION: SEQ ID NO:44: 
GCTGTAAGGG 



(2) INFORMATION FOR SEQ ID NO:45: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO:41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 
CCTGCTGTAA 



(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 
CTGTAAGGGA 

(2) INFORMATION FOR SEQ ID NO:46: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 
TGTAAGGGAA 

(2) INFORMATION FOR SEQ ID NO:47: 



00 MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

GTAAGGGAAA 

<2) INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
TAAGGGAAAG 

(2) INFORMATION FOR SEQ ID NO:49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

AAGGGAAAGA 

(2) INFORMATION FOR SEQ ID NO:50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECUUE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 

AGGGAAAGAA 

(2) INFORMATION FOR SEQ ID NO:51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: 

GGGAAAGAAT 



(2) INFORMATION FOR SEQ ID NO:52: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

GGAAAGAATG 

<2) INFORMATION FOR SEQ ID NO:53: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

GAAAGAATGA 

(2) INFORMATION FOR SEQ ID NO:54: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE- DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 

AAAGAATGAG 

(2) INFORMATION FOR SEQ ID NO:55: 

G) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: 

AAGAATGAGA 

(2) INFORMATION FOR SEQ ID NO:56: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 
AGAATGAGAC 1 

(2) INFORMATION FOR SEQ ID NO:57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 
GAATGAGACG 
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0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:61: 
GAGACGAGCT 



(2) INFORMATION FOR SEQ ID NO:62: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO:58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 
AATGAGACGA 



(2) INFORMATION FOR SEQ ID NO:59: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:59: 

ATGAGACGAG 



(ii) MOLECULE TYPE: DNA 
(xO SEQUENCE DESCRIPITON: SEQ ID NO:62: 
AGACGAGCTG 

(2) INFORMATION FOR SEQ ID NO:63: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:63: 

GACGAGCTGA 



(2) INFORMATION FOR SEQ ID NO:64: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO:60: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 

TGAGACGAGC 



00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 
ACGAGCTOAG 



(2) INFORMATION FOR SEQ ID NO:65: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO:61: 



(ii) MOLECULE TYPE: DNA 



WO 95/21912 



PCT/AU95/00063 



87- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 
CGAGCTOAGC 

(2) INFORMATION FOR SEQ ID NO:66: 

0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

GAGCTGAGCC 

(2) INFORMATION FOR SEQ ID NO:67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 

AGCTGAGCCA 

(2) INFORMATION FOR SEQ ID NO:68: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 
GCTGAGCCAG 

(2) INFORMATION FOR SEQ ID NO:69: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: 

CTGAGCCAGC 

(2) INFORMATION FOR SEQ ID NO:70: 

(I) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ 1DNO:70: 
TGAGCCAGCA 

(2) INFORMATION FOR SEQ ID NO:71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:71: 
GAGCCAGCAG 

(2) INFORMATION FOR SEQ ID NO:72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 
AGCCAGCAGC 

(2) INFORMATION FOR SEQ ID NO:73: 

0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:73: 

GCCAGCAGCA 

(2) INFORMATION FOR SEQ ID NO:74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: 
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CCAGCAGCAG 

(2) INFORMATION FOR SEQ ID NO:75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 
CAGCAGCAGA 

(2) INFORMATION FOR SEQ ID NO:76: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;76: 

AGCAGCAGAT 

(2) INFORMATION FOR SEQ ID N0.77: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 

GCAGCAGATG 

(2) INFORMATION FOR SEQ ID NO:78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: 
CAGCAGATGG 

(2) INFORMATION FOR SEQ ID NO:79: 

(i) SEQUENCE CHARACTERISTICS: * 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 
AGCAGATGGG 

(2) INFORMATION FOR SEQ ID NO:80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: 
GCAGATGGGG 

(2) INFORMATION FOR SEQ ID NO;81 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sbglc 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8I: 
CAGATGGGGT 

(2) INFORMATION FOR SEQ ID NO:82: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 
AGATGGGGTG 

(2) INFORMATION FOR SEQ ID NO:83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:83: 
GATGGGGTGG 
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(2) INFORMATION FOR SEQ ID NO:84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObascpaiis 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: 
ATGGGGTGGG 

(2) INFORMATION FOR SEQ ID NO:85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(il) MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: 

TGGGGTGGGA 

(2) INFORMATION FOR SEQ ID NO:86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: 
GGGGTGGGAG 

(2) INFORMATION FOR SEQ ID NO:87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:87: 

GGGTGGGAGC 

(2) INFORMATION FOR SEQ ID NO:88: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 
GGTGGGAGCA 

(2) INFORMATION FOR SEQ ID NO:89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE* DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:89: 
GTGGGAGCAG 

(2) INFORMATION FOR SEQ ID NO:90: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:90: 
TGGGAGCAGT 

(2) INFORMATION FOR SEQ ID NO:91: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO.-91: 
GGGAGCAGTA 

(2) INFORMATION FOR SEQ ID NO:92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:92: 
GGAGCAGTAT 

(2) INFORMATION FOR SEQ ID NO:93: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 
GAGCAGTATC 

(2) INFORMATION FOR SEQ ID NO:94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

Oi) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0.94: 

AGCAGTATCT 

(2) INFORMATION FOR SEQ ID NO:95: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.-95: 

GCAGTATCTC 

(2) INFORMATION FOR SEQ ID NOS6: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

0Q MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.-96: 

CAGTATCTCG 

(2) INFORMATION FOR SEQ ID NO:97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:97: 
AGTATCTCGA 

(2) INFORMATION FOR SEQ ID NO:98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:98: 
GTATCTCGAG 

(2) INFORMATION FOR SEQ ID NO:99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOUECULE TYPE: DNA * 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOS9: 
TATCTCGAGA 

(2) INFORMATION FOR SEQ ID NO:10C: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

()d) SEQUENCE DESCRIPTION: SEQ ID NO:I00: 

ATCTCGAGAC 

(2) INFORMATION FOR SEQ ID NO: 101: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECUIJE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:101; 

TCTCGAGACC 

(2) INFORMATION FOR SEQ ID NO:102: 
(0 SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;I02: 

CTCGAGACCT 

(2) INFORMATION FOR SEQ ID NO:103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
TCGAGACCTA 

(2) INFORMATION FOR SEQ ED NO:104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:I04: 

CGAGACCTAG 

(2) INFORMATION FOR SEQ ID NO:105: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

C») MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:I05: 

GAGACCTAGA 

(2) INFORMATION FOR SEQ ID NO:106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:106: 
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AGACCTAGAA 

(2) INFORMATION FOR SEQ ID NO:107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:I07: 
GACCTAGAAA 

(2) INFORMATION FOR SEQ ID NO:108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:I08: 
ACCTAGAAAA 

(2) INFORMATION FOR SEQ ID NO:109: - 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

CCTAGAAAAA 

(2) INFORMATION FOR SEQ ID NO:l 1 0: 

(0 SEQUENCE CHARACTERISTICS: 

(A) - LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xD SEQUENCE DESCRIPTION: SEQ ID NO:110: 

CTAGAAAAAC 

(2) INFORMATION FOR SEQ ID NO:lI 1: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(Q STRANDEDNES S : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 11: 

TAGAAAAACA 

(2) INFORMATION FOR SEQ ID NO:l 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNES S : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 12: 

AGAAAAACAT 

(2) INFORMATION FOR SEQ ID NO:113: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1 13: 

GAAAAACATG 

(2) INFORMATION FOR SEQ ID NO: 114; 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:114: 

AAAAACATGG 

(2) INFORMATION FOR SEQ ID NO:l 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:II5: 

AAAACATGGA 



(2) INFORMATION FOR SEQ ID NO: 1 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:! 16: 
AAACATGGAG 

(2) INFORMATION FOR SEQ ID NO:l 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID Nail7: 
AACATGGAGC 

(2) INFORMATION FOR SEQ ID NO:l 18: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll8: 

ACATGGAGCA 

(2) INFORMATION FOR SEQ ID NO:1 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
CATGGAGCAA 

(2) INFORMATION FOR SEQ ID NO:I20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 
ATGGAGCAAT 

(2) INFORMATION FOR SEQ ID NO:12i: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:I21: 

TGGAGCAATC 

(2) INFORMATION FOR SEQ ID NO:I22: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122; 
GGAGCAATCA 

(2) INFORMATION FOR SEQ ID NO:123: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:123: 
GAGCAATCAC 

(2) INFORMATION FOR SEQ ID NO:124: 

(0 SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:124: 
AGCAATCACA 

(2) INFORMATION FOR SEQ ID NO: 125: 



0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:!25: 
GCAATCACAA 

(2) INFORMATION FOR SEQ ID NO:126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 
CAATCACAAG 

(2) INFORMATION FOR SEQ ID NO: 127: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:127: 
AATCACAAGT 

(2) INFORMATION FOR SEQ ID NO: 128: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:128: 

ATCACAAGTA 

(2) INFORMATION FOR SEQ ID NO:129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:129: 
TCACAAGTAG 

(2) INFORMATION FOR SEQ ID NO: 1 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 130: 

CACAAGTAGC 

(2) INFORMATION FOR SEQ ID NO:I31: 

0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:131: 

ACAAGTAGCA 

(2) INFORMATION FOR SEQ ID NO:132: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:132: 
CAAGTAGCAA 

(2) INFORMATION FOR SEQ ID NO:133: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:133: 

AAGTAGCAAT 

(2) INFORMATION FOR SEQ ID NO:134: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 
AGTAGCAATA 

(2) INFORMATION FOR SEQ ID NO:135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:135: 
GTAGCAATAC 

(2) INFORMATION FOR SEQ ID NO:136: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:136: 
TAGCAATACA 

(2) INFORMATION FOR SEQ ID NO:137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 
AGCAATACAG 

(2) INFORMATION FOR SEQ ID NO:I38: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:138: 
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GCAATACAGC 

(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:I39: 

CAATACAGCA 

(2) INFORMATION FOR SEQ ID NO:I40: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:I40: 

AATACAGCAG 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:I41: 
ATACAGCAGC 

(2) INFORMATION FOR SEQ ID NO: 142: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TVPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 42: 

TACAGCAGCT 

(2) INFORMATION FOR SEQ ID NO:143: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(Q STRANDEDNESS: single 
(D) TOPOLOGY: iinear 

(ii) MOLECULE TVPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:143: 

ACAGCAGCTA 

(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:144: 
CAGCAGCTAA 

(2) INFORMATION FOR SEQ ID NO:145: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
AGCAGCTAAC 

(2) INFORMATION FOR SEQ ID NO:146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ1DN0:146: 
GCAGCTAACA 

(2) INFORMATION FOR SEQ ID NO:147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear . 

(u) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:147: 

CAGCTAACAA 
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(2) INFORMATION FOR SEQ ID NO:148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:148: 

AGCTAACAAT 

(2) INFORMATION FOR SEQ ID NO:149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:149: 

GCTAACAATG 

(2) INFORMATION FOR SEQ ID NO:150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:150: 
CTAACAATGC 

(2) INFORMATION FOR SEQ ID NO:15I: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:151: 

TAACAATGCT 

(2) INFORMATION FOR SEQ ED NO:152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:152: 
AACAATGCTG 

(2) INFORMATION FOR SEQ ID NO:153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:153: 
ACAATGCTGC 

(2) INFORMATION FOR SEQ ID N0.154: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:154: 

CAATGCTGCT 

(2) INFORMATION FOR SEQ ID NO: 155: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:155: 

AATGCTGCTT 

(2) INFORMATION FOR SEQ ID NO:156: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE- nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:156: 
ATGCTGCITG 

(2) INFORMATION FOR SEQ ID NO:157: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucJcic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:157: 
TGCTGCTTGT 

(2) INFORMATION FOR SEQ ID NO:158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:158: 
GCTGCTTGTG 

(2) INFORMATION FOR SEQ ID NO: 159: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:159: 
CTGCTTGTGC 

(2) INFORMATION FOR SEQ ED NO; 160: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:160: 

TGCTTGTGCC 

(2) INFORMATION FOR SEQ ID NO:I61: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:161: 
GCTTGTGCCT 

(2) INFORMATION FOR SEQ ID NO: 1 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:162: 
CrTGTGCCTG 

(2) INFORMATION FOR SEQ ID NO:I63: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:I63: 

TTGTGCCTGG 

(2) INFORMATION FOR SEQ ID NO:164: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

TGTGCCTGGC 

(2) INFORMATION FOR SEQ ID NO:165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO:165: 

GTGCCTGGCT 

(2) INFORMATION FOR SEQ ID NO: 166: 
0) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:166: 

TGCCTGGCTA 

(2) INFORMATION FOR SEQ ID NO:167: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:167: 

GCCTGGCTAG 

(2) INFORMATION FOR SEQ ID NO:16S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

0i) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:168: 

CCTGGCTAGA 

(2) INFORMATION FOR SEQ ID NO:169: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:169: 
CTGGCTAGAA 

(2) INFORMATION FOR SEQ ID NO:170: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:170: 
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TGGCTAGAAG 

(2) INFORMATION FOR SEQ ID NO:!71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS : single 
(D)*TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:l71: 

GGCTAGAAGC 

(2) INFORMATION FOR SEQ ID NO:172: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:172: 
GCTAGAAGCA 

(2) INFORMATION FOR SEQ ID NO:173: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:173: 
CTAGAAGCAC 

(2) INFORMATION FOR SEQ ID NO:174: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:174: 

TAGAAGCACA 

(2) INFORMATION FOR SEQ ID NO:175: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:175: 

AGAAGCACAA 

(2) INFORMATION FOR SEQ ID NO:176: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:176: 

GAAGCACAAG 

(2) INFORMATION FOR SEQ ID NO:177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:I77: 

AAGCACAAGA 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:178: 

AGCACAAGAG 

(2) INFORMATION FOR SEQ ID NO:179: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

GCACAAGAGG 



(2) INFORMATION FOR SEQ ID NO:180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 
CACAAGAGGA 

(2) INFORMATION FOR SEQ ID NO:l81: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOIXCULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:181: 

ACAAGAGGAG 

(2) INFORMATION FOR SEQ ID NO:l 82: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

CO MOLECULE TYPE: DNA . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:182: 

CAAGAGGAGG 

(2) INFORMATION FOR SEQ ID NO:183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:183: 
AAGAGGAGGA 

(2) INFORMATION FOR SEQ ID NO:184: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 84: 
AGAGOAGGAA 

(2) INFORMATION FOR SEQ ID NO: 185: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

GAGGAGGAAG 

(2) INFORMATION FOR SEQ ID NO:I86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:I86: 
AGGAGGAAGA 

(2) INFORMATION FOR SEQ ID NO:187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

GGAGGAAGAG 

(2) INFORMATION FOR SEQ ID NO: 188: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi*) SEQUENCE DESCRIPTION: SEQ ID NO:188: 
GAGGAAGAGG 

(2) INFORMATION FOR SEQ ID N0.189: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 89: 
AGGAAGAGGT 

(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:190: 

GGAAGAGGTG 

(2) INFORMATION FOR SEQ ID NO:191: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:191: 

GAAGAGGTGG 

(2) INFORMATION FOR SEQ ED NO:192: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:192: 

AAGAGGTGGG 

(2) INFORMATION FOR SEQ ID NO:193: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(9 STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:193: 
AGAGGTGGGT 

(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:194: 
GAGGTGGGTT 

(2) INFORMATION FOR SEQ ID NO:195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

C») MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 
AGGTGGGTTT 

(2) INFORMATION FOR SEQ ID NO: 196: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:I96: 

GGTGGGTTTT 

(2) INFORMATION FOR SEQ ID NO:197: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 
GTGGorniC 

(2) INFORMATION FOR SEQ ID NO:198: 

0) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 
TGGGTnTCC 

(2) INFORMATION FOR SEQ ID NO:199: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xO SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

GGGTI TICCA 

(2) INFORMATION FOR SEQ ID NO2Q0: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xO SEQUENCE DESCRIPTION: SEQ ID NO200: 

GGTTTTCCAG 

(2) INFORMATION FOR SEQ ID NO201: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID N0201: 

GTTTTCCAGT 

(2) INFORMATION FOR SEQ ID NO:202: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi") SEQUENCE DESCRIPTION: SEQ ID NO202: 
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TTTTCCAGTC 

(2) INFORMATION FOR SEQ ID NO303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO203: 

TTTCCAGTCA 

(2) INFORMATION FOR SEQ ID NO204: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO204: 

TTCCAGTCAC 

(2) INFORMATION FOR SEQ ID NO:205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO205: 

TCCAGTCACA 

(2) INFORMATION FOR SEQ ID NO206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:206: 

CCAGTCACAC 

(2) INFORMATION FOR SEQ ID NO207: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:207: 

CAGTCACACC 

(2) INFORMATION FOR SEQ ID NO:208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO208: 
AGTCACACCT 

(2) INFORMATION FOR SEQ ID NO209: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:209: 
GTCACACCTC 

(2) INFORMATION FOR SEQ ID NO210: 

(0 SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

0i) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO210: 

TCACACCTCA 

(2) INFORMATION FOR SEQ ID N02U: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 

* 00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N021 1: 

CACACCTCAG 
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(2) INFORMATION FOR SEQ ID NO:2I2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0212: 
ACACCTCAGG 

(2) INFORMATION FOR SEQ ID N0213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECIM TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0213: 

CACCTCAGGT 

(2) INFORMATION FOR SEQ ID N0214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0214: 

ACCTCAGGTA 

(2) INFORMATION FOR SEQ ID NO:215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0215: 

CCTCAGGTAC 

(2) INFORMATION FOR SEQ ID NO:216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



fii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:216: 
CTCAGGTACC 

(2) INFORMATION FOR SEQ ID N0217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0217: 
TCAGGTACCT 

(2) INFORMATION FOR SEQ ID N0218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N021 8: 
CAGGTACCTT 

(2) INFORMATION FOR SEQ ID N0219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

0i) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:219: 
AGGTACCTTT 

(2) INFORMATION FOR SEQ ID NO:220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic arid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:220: 
GGTACCTTTA 

(2) INFORMATION FOR SEQ ID NO-^21: 



WO 95/21912 



PCT/AU95/00063 



104 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDN0221: 
GTACCTTTAA 

(2) INFORMATION FOR SEQ ID NO:222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:222: 

TACCTTTAAG 

(2) INFORMATION FOR SEQ ID NO:223: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:223: 

ACCTITAAGA 

(2) INFORMATION FOR SEQ ID NO:224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:224: 

CCTTTAAGAC 

(2) INFORMATION FOR SEQ ID NO:225: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:225: 
CTTTAAGACC 

(2) INFORMATION FOR SEQ ID NO:226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDN0526: 
TTTAAGACCA 

(2) INFORMATION FOR SEQ ID NO:227: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:227: 

TTAAGACCAA 

(2) INFORMATION FOR SEQ ID NO:228: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:228: 

TAAGACCAAT 

(2) INFORMATION FOR SEQ ID N0229: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;229: 

AAGACCAATG 

(2) INFORMATION FOR SEQ ID NO:230: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:230: 
AGACCAATGA 

(2) INFORMATION FOR SEQ ID NO:231: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:231: 
GACCAATGAC 

(2) INFORMATION FOR SEQ IDNO:232: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:232: 

ACCAATGACT 

(2) INFORMATION FOR SEQ ID NO:233: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:233: 

CCAATGACTT 

(2) INFORMATION FOR SEQ ID NO:234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:234: 
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CAATGACTTA 

(2) INFORMATION FOR SEQ ID N0235: 

0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:235: 
AATGACTTAC 

(2) INFORMATION FOR SEQ ID NO:236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0236: 

ATGACTTACA 

(2) INFORMATION FOR SEQ ID NO*237: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-237: 

TGACTTACAA 

(2) INFORMATION FOR SEQ ID NO:238: 

(0 SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:238: 

GACTTACAAG 

(2) INFORMATION FOR SEQ ID NO-239: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Ql) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:239: 
ACTTACAAGG 

(2) INFORMATION FOR SEQ ID NO240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(9 STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:240: 

CTTACAAGGC 

(2) INFORMATION FOR SEQ ID N0241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24I: 

TTACAAGGCA 

(2) INFORMATION FOR SEQ ID N0242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0.242: 

TACAAGGCAG 

(2) INFORMATION FOR SEQ ID N0243: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:243: 

ACAAGGCAGC 



(2) INFORMATION FOR SEQ ID N0244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:244: 
CAAGGCAGCT 

(2) INFORMATION FOR SEQ ID N0245: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xQ SEQUENCE DESCRIPTION: SEQ ID NO:245: 
AAGGCAGCTG 

(2) INFORMATION FOR SEQ ID N0246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID N0246: 

AGGCAGCTGT 

(2) INFORMATION FOR SEQ ID N0247: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:247: 

GGCAGCTGTA 

(2) INFORMATION FOR SEQ ID NO:248: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0248: 
GCAGCTGTAG 

(2) INFORMATION FOR SEQ ID N0249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:249: 

CAGCTGTAGA 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0253: 
TGTAGATCTT 



(2) INFORMATION FOR SEQ ID NO:254: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO250: 

AGCTGTAGAT 



00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0254: 
GTAGATCTTA 



(2) INFORMATION FOR SEQ ID N0255: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO:251: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0251: 

GCTGTAGATC 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:255: 
TAGATCTTAG 



(2) INFORMATION FOR SEQ ID N0256: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ED NO:252: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0252: 

CTGTAGATCT 



00 MOLECULE TYPE: DNA 
(x0 SEQUENCE DESCRIPTION: SEQ ID N0256: 
AGATCTTAGC 



(2) INFORMATION FOR SEQ ID N0257: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID N0253: 



00 MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:257: 
GATCTTAGCC 

(2) INFORMATION FOR SEQ ID N0258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0258: 

ATCTTAGCCA 

(2) INFORMATION FOR SEQ ID N0:259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
P) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0259: 

TCTTAGCCAC 

(2) INFORMATION FOR SEQ ID NO:260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO260: 
CTTAGCCACT 

(2) INFORMATION FOR SEQ ID N0261: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDN026I: 

TTAGCCACTT 

(2) INFORMATION FOR SEQ ID N0262: 

(i) SEQUENCE CHARACTERISTICS : 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:262: 
TAGCCACTTT 

(2) INFORMATION FOR SEQ ID N0263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi*) SEQUENCE DESCRIPTION: SEQ ID N0263: 

AGCCAC'l I'M 

(2) INFORMATION FOR SEQ ID N0264: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID N0264: 
GCCACTTTTT 

(2) INFORMATION FOR SEQ ID N0265: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDN0265: 
CCACTTTTTA 

(2) INFORMATION FOR SEQ ID N0^66: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0266: 
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CACTTTTTAA 

(2) INFORMATION FOR SEQ ID N0267: 

0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

Oi) MOLECULE WE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0267: 

AC1 1 I I I AAA 

(2) INFORMATION FOR SEQ ID NO:268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

'. (u) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID N0268: 

CTTTTTAAAA 

(2) INFORMATION FOR SEQ ID NO:269: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0269: 

TTTTTAAAAG 

(2) INFORMATION FOR SEQ ID NO270: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO270: 

TTTTAAAAGA 

(2) INFORMATION FOR SEQ ID N0271: 

CO SEQUENCE CHARACTERISTICS: 
• (A) LENGTH: 10 base pairs 
(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N027I: 
TTTAAAAGAA 

(2) INFORMATION FOR SEQ ID NO:272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0272: 
TTAAAAGAAA 

(2) INFORMATION FOR SEQ ID N0273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTOR- SEQ ID NO:273: 
TAAAAGAAAA 

(2) INFORMATION FOR SEQ ID N0274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0274: 
AAAAGAAAAG 

(2) INFORMATION FOR SEQ ID NO:275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0275: 
AAAGAAAAGG 
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(2) INFORMATION FOR SEQ ID NO:276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0276: 
AAGAAAAGGG 

(2) INFORMATION FOR SEQ ID NO:277: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:277: 

AGAAAAGGGG 

(2) INFORMATION FOR SEQ ID N0278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOS78: 

GAAAAGGGGG 

(2) INFORMATION FOR SEQ ID N0279; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0279: 

AAAAGGGGGG 

(2) INFORMATION FOR SEQ ID NO380: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO380: 
AAAGGGGGGA 

(2) INFORMATION FOR SEQ ID NO:28i: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

0i) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:281: 

AAGGGGGGAC 

(2) INFORMATION FOR SEQ ID NO:282: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0282: 
AGGGGGGACT 

(2) INFORMATION FOR SEQ ID NO:283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:283: . 
GGGGGGACTG 

(2) INFORMATION FOR SEQ ID NO:284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(iQ MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0284: 

GGGGGACTGG 

(2) INFORMATION FOR SEQ ID N0285: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0285: 

GGGGACTGGA 

(2) INFORMATION FOR SEQ ID N0286: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:286: 
GGGACFGGAA 

(2) INFORMATION FOR SEQ ID N0287: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0287: 

GGACTGGAAG 

(2) INFORMATION FOR SEQ ID NO:288: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0088: 
GACTGGAAGG 

(2) INFORMATION FOR SEQ ID N0:289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0289: 
ACTGGAAGGG 

(2) INFORMATION FOR SEQ ID NO:290; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

GO MOLECULE TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO290: 

CTGGAAGGGC 

(2) INFORMATION FOR SEQ ID NO:291: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOS91: 
TGGAAGGGCT 

(2) INFORMATION FOR SEQ ID N0292: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:292: 
GGAAGGGCTA 

(2) INFORMATION FOR SEQ ID NO:293: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

C") MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:293: 

GAAGGGCTAA 

(2) INFORMATION FOR SEQ ID N0294: 
(0 SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:294: 

AAGGGCTAAT 

(2) INFORMATION FOR SEQ ID NO:295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(i0 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:295: 

AGGGCTAATT 

(2) INFORMATION FOR SEQ ID N0296: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID NO:296: 

GGGCTAATTC 

(2) INFORMATION FOR SEQ ID N0297: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xO SEQUENCE DESCRIPTION: SEQ ID N0297: 

GGCTAATTCA 

(2) INFORMATION FOR SEQ ID N0298: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0298: 
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GCTAATTCAC 

(2) INFORMATION FOR SEQ ID N0299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0299: 

CTAATTCACT 

(2) INFORMATION FOR SEQ ED NO:300: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO 300: 

TAATTCACTC 

(2) INFORMATION FOR SEQ ID NO301: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO JO 1: 

AATTCACTCC 

(2) INFORMATION FOR SEQ ID NO:302: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TVPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:302: 
ATTCACTCCC 

(2) INFORMATION FOR SEQ ID NO:303: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TVPE: nucleic acid 
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(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO303: 

TTCACTCCCA 

(2) INFORMATION FOR SEQ ID NO:304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:304: 
TCACTCCCAA 

(2) INFORMATION FOR SEQ ID NO305: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:305: 

CACTCCCAAA 

(2) INFORMATION FOR SEQ ID NO306: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE T/PE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO306: 

ACTCCCAAAG 

(2) INFORMATION FOR SEQ ID NO307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:307: 

CTCCCAAAGA 



(2) INFORMATION FOR SEQ ID NO:308: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:308: 
TCCCAAAGAA 

(2) INFORMATION FOR SEQ ID NO 309: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO309: 
CCCAAAGAAG 

(2) INFORMATION FOR SEQ ID NO310: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS:. single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO J 10: 
CCAAAGAAGA 

(2) INFORMATION FOR SEQ ID N031 1 : 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N031 1: 

CAAAGAAGAC 

(2) INFORMATION FOR SEQ ID NO 3 12: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



WO 95/21912 



PCT/AU95/00063 



114 



00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:312: 
AAAGAAGACA 

(2) INFORMATION FOR SEQ ID NO:3I3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDN0313: 

AAGAAGACAA 

(2) INFORMATION FOR SEQ ID N03I4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 3 14: 

AGAAGACAAG 

(2) INFORMATION FOR SEQ ID N0315: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0315: 

GAAGACAAGA 

(2) INFORMATION FOR SEQ ID N0316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xO SEQUENCE DESCRIPTION: SEQ ID N0316: 
AAGACAAGAT 

(2) INFORMATION FOR SEQ ID N0317: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
•(B) TYPE: nucleic acid 
(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOJ17: 
AGACAAGATA 

(2) INFORMATION FOR SEQ ID NOJ18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:318: 

GACAAGATAT 

(2) INFORMATION FOR SEQ ID NO:319: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0319: 

ACAAGATATC 

(2) INFORMATION FOR SEQ ID NO320: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xO SEQUENCE DESCRIPTION: SEQ ID NO320: 

CAAGATATCC 

(2) INFORMATION FOR SEQ ID NOJ21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0321: 
AAGATATCCT 

(2) INFORMATION FOR SEQ ID NO:322: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:322: 
AGATATCCTT 

(2) INFORMATION FOR SEQ ID N0323: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE- nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOUECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0323: 

GATATCCTTG 

(2) INFORMATION FOR SEQ ID N0324: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOS24: 

ATATCCTTGA 

(2) INFORMATION FOR SEQ ID NOS2S: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:325: 

TATCCTTGAT 

(2) INFORMATION FOR SEQ ID NO 326: 

(0 SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi*) SEQUENCE DESCRIPTION: SEQ ID NO:326: 

ATCCTTGATC 

(2) INFORMATION FOR SEQ ID NO:327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0327: 
TCCTTGATCT 

(2) INFORMATION FOR SEQ ID NOS28: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0328: 

CCTTGATCTG 

(2) INFORMATION FOR SEQ ID N0329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0329: 
CTTGATCTGT 

(2) INFORMATION FOR SEQ ID NO330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO J 30: 
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TTGATCTGTG 

(2) INFORMATION FOR SEQ ID N0331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:331: 
TGATCTGTGG 

(2) INFORMATION FOR SEQ ID NO:332: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0332: 

GATCTGTGGA 

(2) INFORMATION FOR SEQ ID NOJ33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:333: 

ATCTGTGGAT 

(2) INFORMATION FOR SEQ ID N0334: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

C") MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO J34: 
TCTGTGGATC 

(2) INFORMATION FOR SEQ ID NOS35: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:335: 
CTGTGGATCT 

(2) INFORMATION FOR SEQ ID NOJ36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:336: 
TGTGGATCTA 

(2) INFORMATION FOR SEQ ID N0337: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0337: 
GTGGATCTAC 

(2) INFORMATION FOR SEQ ID NO:338: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0338: 

TGGATCTACC 

(2) INFORMATION FOR SEQ ID NO:339: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:339: 
GGATCTACCA 
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(2) INFORMATION FOR SEQ ID NO:340: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 340: 

GATCTACCAC 

(2) INFORMATION FOR SEQ ID N0341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:341: 

ATCTACCACA 

(2) INFORMATION FOR SEQ ID N0342: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0342: 

TCTACCACAC 

(2) INFORMATION FOR SEQ ID N0343: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

OS) MOLECULE TVPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:343: 
CTACCACACA 

(2) INFORMATION FOR SEQ ID NO:344: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:344: 
TACCACACAC 

(2) INFORMATION FOR SEQ ID NO:345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xQ SEQUENCE DESCRIPTION: SEQ ID NOS45: 
ACCACACACA 

(2) INFORMATION FOR SEQ ID N0346: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDN0346: 
CCACACACAA 

(2) INFORMATION FOR SEQ ID N0347: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0347: 

CACACACAAG 

(2) INFORMATION FOR SEQ ID N0348: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:348: 
ACACACAAGG 

(2) INFORMATION FOR SEQ ID NO:349: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:349: 

CACACAAGGC 

(2) INFORMATION FOR SEQ ID NO:350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO350: 
ACACAAGGCT 

(2) INFORMATION FOR SEQ ID N0351: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucleic acid 

(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECUUE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOJ51: 

CACAAGGCTA 

(2) INFORMATION FOR SEQ ID N0352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO*352: 
ACAAGGCTAC 

(2) INFORMATION FOR SEQ ID N0353: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:353: 
CAAGGCTACT 

(2) INFORMATION FOR SEQ ID NO:354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0354: 
AAGGCTACTT 

(2) INFORMATION FOR SEQ ID N0355: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0355: 
AGGCTACTTC 

(2) INFORMATION FOR SEQ ID NO-356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0356: 

GGCTACTTCC 

(2) INFORMATION FOR SEQ ID NO*357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) sequence description: seq id no:357: 
gctacitccc 

(2) INFORMATION FOR SEQ ID N0358: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) sequence description: seq id no:358: 
ctacttccct 

(2) information for seq id no 359: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0359: 

TACriCCCTG 

(2) INFORMATION FOR SEQ ID NO360: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:360: 
ACTTCCCTGA 

(2) INFORMATION FOR SEQ ID N0361 : 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:361: 

CTTCCCTGAT 

(2) INFORMATION FOR SEQ ID NO:362: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0362: 
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TTCCCTGATT 

(2) INFORMATION FOR SEQ ID N0363: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0363: 
TCCCTGATTG 

(2) INFORMATION FOR SEQ ID NO:364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:364: 

CCCTGATTGG 

(2) INFORMATION FOR SEQ ID NO*365: . 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(iD MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0365: 
CCTGATTGGC 

(2) INFORMATION FOR SEQ ID N0366: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0366: 

CTGATTGGCA 

(2) INFORMATION FOR SEQ ID NO*367: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xQ SEQUENCE DESCRIPTION: SEQ ID NO:367: 
TGATTGGCAG 

(2) INFORMATION FOR SEQ ID N0368: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:368: 

GATTGGCAGA 

(2) INFORMATION FOR SEQ ID NOS69: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

flO MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0369: 

ATTGGCAGAA 

(2) INFORMATION FOR SEQ ID NO370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO370: 

TTGGCAGAAC 

(2) INFORMATION FOR SEQ ID N0371: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xO SEQUENCE DESCRIPTION: SEQ ID NOJ71: 

TGGCAGAACT 



(2) INFORMATION FOR SEQ ID N0372: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:372: 

GGCAGAACTA 

(2) INFORMATION FOR SEQ ID NOS73: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID N0373: 

GCAGAACTAC 

(2) INFORMATION FOR SEQ ID NOS74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(x0 SEQUENCE DESCRIPTION: SEQ ID NO:374: 
CAGAACTACA 

(2) INFORMATION FOR SEQ ID NOS75: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:375: 

AGAACTACAC 

(2) INFORMATION FOR SEQ ID N0376: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY; linear 
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(ti) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:376: 
GAACTACACA 

(2) INFORMATION FOR SEQ ID N0377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:377: 

AACTACACAC 

(2) INFORMATION FOR SEQ ID N0378: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0378: 

ACTACACACC 

(2) INFORMATION FOR SEQ ID NO:379: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0379: 
CTACACACCA 

(2) INFORMATION FOR SEQ ID NO380: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:380: 
TACACACCAG 

(2) INFORMATION FOR SEQ ID N0381: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0381: 
ACACACCAGG 

(2) INFORMATION FOR SEQ ID NO:382: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0382: 

CACACCAGGG 

(2) INFORMATION FOR SEQ ID NO J 83: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0383: 

ACACCAGGGC 

(2) INFORMATION FOR SEQ ID N0384: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECUUE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOS84: 
CACCAGGGCC 

(2) INFORMATION FOR SEQ ID N0385: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:385: 
ACCAGGGCCA 

(2) INFORMATION FOR SEQ ID N0386: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

0i) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:386: 

CCAGGGCCAG 

(2) INFORMATION FOR SEQ ID NO:387: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA . 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0387: 
CAGGGCCAGG 

(2) INFORMATION FOR SEQ ID NO:388: 

G) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0388: 

AGGGCCAGGG 

(2) INFORMATION FOR SEQ ID N0389: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:389: 

GGGCCAGGGG 

(2) INFORMATION FOR SEQ ID NO:390: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO390: 
GGCCAGGGGT 

(2) INFORMATION FOR SEQ ID NO:391: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:391: 
GCCAGGGGTC 

(2) INFORMATION FOR SEQ ID N0392: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
P) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0392: 

CCAGGGGTCA 

(2) INFORMATION FOR SEQ ID N0393: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0393: 
CAGGGGTCAG 

(2) INFORMATION FOR SEQ ID N0394: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:394: 
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AGGGGTCAGA 

(2) INFORMATION FOR SEQ ID NO:395: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0395: 

GGGGTCAGAT 

(2) INFORMATION FOR SEQ ID N0396: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 396: 

GGGTCAGATA 

(2) INFORMATION FOR SEQ ID N0397: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NC*397: 

GGTCAGATAT 

(2) INFORMATION FOR SEQ ID N0398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:398: 

GTCAGATATC 

(2) INFORMATION FOR SEQ ID NOS99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:399: 
TCAGATATCC 

(2) INFORMATION FOR SEQ ID NO:400: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:400: 
CAGATATCCA 

(2) INFORMATION FOR SEQ ID NO:401: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:401: 

AGATATCCAC 

(2) INFORMATION FOR SEQ ID NO:402: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:402: 

GATATCCACT 

(2) INFORMATION FOR SEQ ID NO:403: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE 1YPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:403: 

ATATOCACTG 
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(2) INFORMATION FOR SEQ ID NO:404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Gi) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:404: 
TATCCACTGA 

(2) INFORMATION FOR SEQ ID NO:405: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:405: 

ATCCACTGAC 

(2) INFORMATION FOR SEQ ID NO:406: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:406: 

TCCACTGACC 

(2) INFORMATION FOR SEQ ID NO:407: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:407: 
CCACTGACCT 

(2) INFORMATION FOR SEQ ID NO:408: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:408: 
CACTGACCTT 

(2) INFORMATION FOR SEQ ID NO:409: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:409: 

ACTGACCTTT 

(2) INFORMATION FOR SEQ ID NO:4I0: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:410: 
CTGACCTTTG 

(2) INFORMATION FOR SEQ ID NO:41 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:411: . 
TGACCTTTGG 

(2) INFORMATION FOR SEQ ID NO:412: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:412: 

GACCTTTGGA 

(2) INFORMATION FOR SEQ ID NO:413: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:413: 

ACCTTTGGAT 

(2) INFORMATION FOR SEQ ID NO:414: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:414: 
CCTTTGGATG 

(2) INFORMATION FOR SEQ ID NO:41S: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:415: 

CTTTGGATGG 

(2) INFORMATION FOR SEQ ID NO:416: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:416: 

TTTGGATGGT 

(2) INFORMATION FOR SEQ ID NO:417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4I7: 
TTGGATGGTG 

(2) INFORMATION FOR SEQ ID NO:418: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:418: 
TGGATGGTGC 

(2) INFORMATION FOR SEQ ID NO:419: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:419: 
GGATGGTGCT 

(2) INFORMATION FOR SEQ ID NO:420: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID NO:420: 

GATGGTGCTA 

(2) INFORMATION FOR SEQ ID NO:421: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

- (xi) SEQUENCE DESCRIPTION: SEQ IDNO:421: 

ATGGTGCTAC 

(2) INFORMATION FOR SEQ ID NO:422: 
(0 SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLjOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:422: 

TGGTGCTACA 

(2) INFORMATION FOR SEQ ID NO:423: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

CO MOLECULE TYPE: DNA 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:423: 

GGTGCTACAA 

(2) INFORMATION FOR SEQ ID NO:424: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:424: 

GTGCTACAAG 

(2) INFORMATION FOR SEQ ID NO:425: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:425: 

TGCTACAAGC 

(2) INFORMATION FOR SEQ ID NO:426: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

GO MOLECULE TYPE: DNA 

(xi*) SEQUENCE DESCRIPTION: SEQ ID NO:426: 
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GCTACAAGCT 

(2) INFORMATION FOR SEQ ID NO:427: 

O) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:427: 

CTACAAGCTA 

(2) INFORMATION FOR SEQ ID NO:428: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:428: 

TACAAGCTAG 

(2) INFORMATION FOR SEQ ID NO:429: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:429: 
ACAAGCTAGT 

(2) INFORMATION FOR SEQ ID NO:430: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:430: 

CAAGCIAGTA 

(2) INFORMATION FOR SEQ ID NO:431: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:431: 

AAGCTAGTAC 

(2) INFORMATION FOR SEQ ID NO:432: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:432: 

AGCTAGTACC 

C2) INFORMATION FOR SEQ ID NO:433: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:433: 

GCTAGTACCA 

(2) INFORMATION FOR SEQ ID NO:434: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xQ SEQUENCE DESCRIPTION: SEQ ID NO:434: 

CTAGTACCAG 

(2) INFORMATION FOR SEQ ID NO:435: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:435: 

TAGTACCAGT 



(2) INFORMATION FOR SEQ ID NO:436: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE- nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE DNA 

(xi*) SEQUENCE DESCRIPTION: SEQ ID NO:436: 
AGTACCAGTT 

(2) INFORMATION FOR SEQ ID NO:437: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:437: 
GTACCAGTTG 

(2) INFORMATION FOR SEQ ID NO:438: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:438: 
TACCAGTTGA 

(2) INFORMATION FOR SEQ ID NO:439: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:439: 
ACCAGTTGAG 

(2) INFORMATION FOR SEQ ID NO:440: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:440: 
CCAGTTGAGC 

(2) INFORMATION FOR SEQ ID NO:441: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44l: 

CAGTTGAGCC 

(2) INFORMATION FOR SEQ ID NO:442: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:442: 

AGTTGAGCCA 

(2) INFORMATION FOR SEQ ID NO:443: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:443: 
GTTGAGCCAG 

(2) INFORMATION FOR SEQ ID NO:444: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:444: 

TTGAGCCAGA 

(2) INFORMATION FOR SEQ ID NO:445: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS : single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:445: 

TGAGCCAGAT 

(2) INFORMATION FOR SEQ ID NO:446: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:446: 

GAGCCAGATA 

(2) INFORMATION FOR SEQ ID NO:447: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:447: 

AGCCAGATAA 

(2) INFORMATION FOR SEQ ID NO:448: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:44S: 

GCCAGATAAG 

(2) INFORMATION FOR SEQ ID NO:449: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:449: 
CCAGATAAGG 

(2) INFORMATION FOR SEQ ID NO:450: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:450: 
CAGATAAGGT 

(2) INFORMATION FOR SEQ ID NO:451: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45I: 

AGATAAGGTA 

(2) INFORMATION FOR SEQ ID NO:452: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:452: 
GATAAGGTAG 

(2) INFORMATION FOR SEQ ID NO:453: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:453: 

ATAAGGTAGA 

(2) INFORMATION FOR SEQ ID NO:454: 

(i) SEQUENCE CHARACTERISTICS : 



(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:454: 
TAAGGTAGAA 

(2) INFORMATION FOR SEQ ID NO:455: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:455: 

AAGGTAGAAG 

(2) INFORMATION FOR SEQ ID NO:456: 

(0 SEQUENCE CHARACTERISTICS: 
• (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi*) SEQUENCE DESCRIPTION: SEQ ID NO:456: 
AGGTAGAAGA 

(2) INFORMATION FOR SEQ ID NO:457: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(x0 SEQUENCE DESCRIPTION: SEQ IDNO:457: 
GGTAGAAGAG 

(2) INFORMATION FOR SEQ ID NO:458: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi*) SEQUENCE DESCRIPTION: SEQ ID NO:458: 
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GTAGAAGAGG 

(2) INFORMATION FOR SEQ ID NO:459: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:459: 
TAGAAGAGGC 

(2) INFORMATION FOR SEQ ID NO:460: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

Ci) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:460: 

AGAAGAGGCC 

(2) INFORMATION FOR SEQ ID NO:461: 

0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:461: 

GAAGAGGCCA 

(2) INFORMATION FOR SEQ ID NO:462: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

GO MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:462: 
AAGAGGCCAA 

(2) INFORMATION FOR SEQ ID NO:463: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:463: 

AGAGGCCAAT 

(2) INFORMATION FOR SEQ ID NO:464: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(iD MOLECULE TYPE: DNA 
(xO SEQUENCE DESCRIPTION: SEQ ID NO:464: 
GAGGCCAATA 

(2) INFORMATION FOR SEQ ID N 0:46 5: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(iO MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:465: 
AGGCCAATAA 

(2) INFORMATION FOR SEQ ID NO:466: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:466: 

GGCCAATAAA 

(2) INFORMATION FOR SEQ ID NO:467: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi*) SEQUENCE DESCRIPTION: SEQ ID NO:467: 

GCCAATAAAG 
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(2) INFORMATION FOR SEQ ID NO:468: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:468: 

CCAATAAAGG 

(2) INFORMATION FOR SEQ ID NO:469: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:469: 

CAATAAAGGA 

(2) INFORMATION FOR SEQ ID NO:470: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:470: 
AATAAAGGAG 

(2) INFORMATION FOR SEQ ID NO:471: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47i: 

ATAAAGGAGA 

(2) INFORMATION FOR SEQ ID NO:472: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:472: 
TAAAGGAGAG 

(2) INFORMATION FOR SEQ ID NO:473: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:473: 
AAAGGAGAGA 

(2) INFORMATION FOR SEQ ID NO:474: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:474: 

AAGGAGAGAA 

(2) INFORMATION FOR SEQ ID NO:475: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:475: 

AGGAGAGAAC 

(2) INFORMATION FOR SEQ ID NO:476: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:476: 

GGAGAGAACA 

(2) INFORMATION FOR SEQ ID NO:477: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:477: 
GAGAGAACAC 

(2) INFORMATION FOR SEQ ID NO:478: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:478: 

AGAGAACACC 

(2) INFORMATION FOR SEQ ID NO:479: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:479: 

GAGAACACCA 

(2) INFORMATION FOR SEQ ID NO:480: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:480: 

AGAACACCAG 

(2) INFORMATION FOR SEQ ID NO:481: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:481: 
GAACACCAGC 

(2) INFORMATION FOR SEQ ID NO:482: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:482: 
AACACCAGCT 

(2) INFORMATION FOR SEQ ID NO:483: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:483: 
ACACCAGCTT 

(2) INFORMATION FOR SEQ ID NO:484: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:484: 

CACCAGCTTG 

(2) INFORMATION FOR SEQ ID NO:485: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:485: 
ACCAGCITGT 

(2) INFORMATION FOR SEQ ID NO:486: 
0) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS : single 
P) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:486: 

CCAGCTTGTT 

(2) INFORMATION FOR SEQ ID NO:487: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE* DNA 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:487: 

CAGCTTGTTA 

(2) INFORMATION FOR SEQ ID NO:488: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:488: 

AGCTTGTTAC 

(2) INFORMATION FOR SEQ ID NO:489: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:489: 
GCTTGTTACA 

(2) INFORMATION FOR SEQ ID NO:490: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:490: 
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CTTGTTACAC 

(2) INFORMATION FOR SEQ ID NO:49I: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:491: 

TTGTTACACC 

(2) INFORMATION FOR SEQ ID NO:492: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 
(xQ SEQUENCE DESCRIPTION: SEQ ID NO:492: 
TGTTACACCC 

(2) INFORMATION FOR SEQ ID NO:493: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:493: 

GTTACACCCT 

(2) INFORMATION FOR SEQ ID NO:494: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:494: 
TTACACCCTG 

(2) INFORMATION FOR SEQ ID NO:495: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:495: 
TACACCCTGT 

(2) INFORMATION FOR SEQ ID NO:496: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:496: 

ACACCCTGTG 

(2) INFORMATION FOR SEQ ID NO:497: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH* 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:497: 

CACCCTGTGA 

(2) INFORMATION FOR SEQ ID NO:498: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:498: 
ACOCTGTGAG 

(2) INFORMATION FOR SEQ ID NO:499: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:499: 

CCCTGTGAGC 



(2) INFORMATION FOR SEQ ID NO:500: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:500: 
CCTGTGAGCC 

(2) INFORMATION FOR SEQ ID NO:501: 

0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:501: 

CTGTGAGCCT 

(2) INFORMATION FOR SEQ ID NO: 5 02: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

. (Q STRANDEDNESS: single 
(D) TOPOLOGY; linear 

(H) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:502: 

TGTGAGCCTC 

(2) INFORMATION FOR SEQ ID NO*503: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:503: 

GTGAGCCTGC 

(2) INFORMATION FOR SEQ ID NO:504: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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GO MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:504: 
TGAGCCTGCA 

(2) INFORMATION FOR SEQ ID NO:505: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:505: 

GAGCCTGCAT 

(2) INFORMATION FOR SEQ ID NO:506: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:506: 

AGCCTGCATG 

(2) INFORMATION FOR SEQ ID NO:507: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:507: 

GCCTGCATGG 

(2) INFORMATION FOR SEQ ID NO:508: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:508: 

CCTGCATGGA 

(2) INFORMATION FOR SEQ ID NO:509: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:509: 
CTGCATGGAA 

(2) INFORMATION FOR SEQ ID NO:510: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:510: 
TGCATGGAAT 

(2) INFORMATION FOR SEQ ID NO:51 1: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:511: 

GCATGGAATG 

(2) INFORMATION FOR SEQ ID NO:512: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0312: 
CATGGAATGG 

(2) INFORMATION FOR SEQ ID NO:513: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:513: 
ATGGAATGGA 

(2) INFORMATION FOR SEQ ID NO:514: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:514: 
TGGAATGGAT 

(2) INFORMATION FOR SEQ ID NO:515: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5I5: 

GGAATGGATG 

(2) INFORMATION FOR SEQ ID NO:516: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 
(xD SEQUENCE DESCRIPTION: SEQ ID NO:516: 
GAATGGATGA 

(2) INFORMATION FOR SEQ ID NO:517: 

fi) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:517: 
AATGGATGAC 

(2) INFORMATION FOR SEQ ID NO:518: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:518: 
ATGGATGACC 

(2) INFORMATION FOR SEQ ID NO:519; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5I9: 
TGGATGACCC 

(2) INFORMATION FOR SEQ ID NO:520: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:520: 
GGATOACCCT 

(2) INFORMATION FOR SEQ ID NO:521: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52I: 
GATGACCCTG 

(2) INFORMATION FOR SEQ ID NO:522: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:522: 
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ATGACCCTGA 

(2) INFORMATION FOR SEQ ID NO:523: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(x0 SEQUENCE DESCRIPTION: SEQ ID N0323: 
TGACCCTGAG 

(2) INFORMATION FOR SEQ ED NO:524: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:524: 

GACCCTGAGA 

(2) INFORMATION FOR SEQ ID NO:525: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 

Cu) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:525: 

ACCCTGAGAG 

(2) INFORMATION FOR SEQ ID NO:526: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:526: 
CCCTGAGAGA 

(2) INFORMATION FOR SEQ ID NO:527: 

0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:527: 

CCTGAGAGAG 

(2) INFORMATION FOR SEQ ID NO:528: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 
(xi*) SEQUENCE DESCRIPTION: SEQ ID NO:528: 
CTGAGAGAGA 

(2) INFORMATION FOR SEQ ID NO:529: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID N0329: 

TGAGAGAGAA 

(2) INFORMATION FOR SEQ ID NO:530: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOS30: 
GAGAGAGAAG 

(2) INFORMATION FOR SEQ ID NO:531: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:531: 

AGAGAGAAGT 
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(2) INFORMATION FOR SEQ ID N0.532: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:532: 
GAGAGAAGTG 

(2) INFORMATION FOR SEQ ID NO:533: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:533: 
AGAGAAGTGT 

(2) INFORMATION FOR SEQ ID NO:534: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:534: 

GAGAAGTGTT 

(2) INFORMATION FOR SEQ ID NO:535: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:535: 

AGAAGTGTTA 

(2) INFORMATION FOR SEQ ID NO:536: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:536: 
GAAGTGTTAG 

(2) INFORMATION FOR SEQ ID NO:537: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:537: 
AAGTGTTAGA 

(2) INFORMATION FOR SEQ ID NO:538: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

m MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:538: 

AGTGTTAGAG 

(2) INFORMATION FOR SEQ ID NO:539: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:539: 
GTGTTAGACT 

(2) INFORMATION FOR SEQ ID NO:540: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:540: 
TGTTAGAGTG 

(2) INFORMATION FOR SEQ ID NO:541: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:541: 

GTTAGAGTGG 

(2) INFORMATION FOR SEQ ID NO:542: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE; DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:542: 
TTAGAGTGGA 

(2) INFORMATION FOR SEQ ID NO:543: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:543: 

TAGAGTGGAG 

(2) INFORMATION FOR SEQ ID NO:544: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:544: 
AGAGTGGAGG 

(2) INFORMATION FOR SEQ ID NO:545: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:545: 
GAGTGGAGGT 

(2) INFORMATION FOR SEQ ID NO:546: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) .TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 
(xO SEQUENCE DESCRIPTION: SEQ ID NO:546: 
AGTGGAGGTT 

(2) INFORMATION FOR SEQ ID NO:547: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:547: 
GTGGAGGTTT 

(2) INFORMATION FOR SEQ ID NO:548: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:548: 
TGGAGGTTTG 

(2) INFORMATION FOR SEQ ID NO:549: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:549: 

GGAGGTTTGA 

(2) INFORMATION FOR SEQ ID NO:550: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

GO MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:550: 

GAGGTTTGAC 

(2) INFORMATION FOR SEQ ID NO:55I: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

GO MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:551: 

AGGTTTGACA 

(2) INFORMATION FOR SEQ ID NO:5S2: 

G) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

GO MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:552: 
GGTTTGACAG 

(2) INFORMATION FOR SEQ ID NO:553: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

GO MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:553: 

GTTTGACAGC 

(2) INFORMATION FOR SEQ ID NO:554: 

G) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

GO MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:554: 
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TTTGACAGCC 

(2) INFORMATION FOR SEQ ID NO:555: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

GO MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:555: 

TTGACAGCCG 

(2) INFORMATION FOR SEQ ID NO:556: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

GO MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:556: 

TGACAGCCGC 

(2) INFORMATION FOR SEQ ID NO:557: 

G) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

GO MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:557: 

GACAGCCGCC 

(2) INFORMATION FOR SEQ ID NO:558: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

GO MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:558: 
ACAGCCGCCT 

(2) INFORMATION FOR SEQ ID NO:559: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:559: 

CAGCCGCCTA 

(2) INFORMATION FOR SEQ ID NO:560: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:560: 
AGCCGCCTAG 

(2) INFORMATION FOR SEQ ID NO:561: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:561: 

GCCGCCTAGC 

(2) INFORMATION FOR SEQ ID NO:562: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: stogie 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0362: 
CCGCCTAGCA 

(2) INFORMATION FOR SEQ ID NO:563: 

0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ED NO:563: 

CGCCTAGCAT 



(2) INFORMATION FOR SEQ ID NO:564: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:564: 
GCCTAGCATT 

(2) INFORMATION FOR SEQ ID NO:565: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:565: 

CCTAGCATTT 

(2) INFORMATION FOR SEQ ID NO:566: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:566: 

CTAGCATTTC 

(2) INFORMATION FOR SEQ ID N0367: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:567: 

TAGCATTTCA 

(2) INFORMATION FOR SEQ ID NO:568: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:568: 
AGCATTTCAT 

(2) INFORMATION FOR SEQ ID NO:569: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECUUE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:569: 

GCATTTCATC 

(2) INFORMATION FOR SEQ ID NO:570: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:570: 
CATTTCATCA 

(2) INFORMATION FOR SEQ ID NO:571 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57I: 

ATTTCATCAC 

(2) INFORMATION FOR SEQ ID NO:572: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:572: 
TTTCATCACG 

(2) INFORMATION FOR SEQ ID NO:573: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:573: 
TTCATCACGT 

(2) INFORMATION FOR SEQ ID NO:574: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

GO MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:574: 

TCATCACGTG 

(2) INFORMATION FOR SEQ ID NO:575: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q. STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:575: 

CATCACGTGG 

(2) INFORMATION FOR SEQ ID NO:576: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:576: 

ATCACGTGGC 

(2) INFORMATION FOR SEQ ID NO:577: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:577: 
TCACGTGGCC 

(2) INFORMATION FOR SEQ ID NO:578: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:578: 

CACGTGGCCC 

(2) INFORMATION FOR SEQ ID NO:579: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:579: 

ACGTGGCCCG 

(2) INFORMATION FOR SEQ ID NO:580: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:580: 
CGTGGCCCGA 

(2) INFORMATION FOR SEQ ID NO*J81 : 

(0 SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:581: 

GTGGCCCGAG 

(2) INFORMATION FOR SEQ ID NO:582: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(x0 SEQUENCE DESCRIPTION: SEQ ID NO:582: 
TGGCCCGAGA 

(2) INFORMATION FOR SEQ ID NO:583: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(iO MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:583: 

GGCCCGAGAG 

(2) INFORMATION FOR SEQ ID NO:584: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:584: 

GCCCGAGAGC 

(2) INFORMATION FOR SEQ ID NO:585: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:585: 
CCCGAGAGCT 

(2) INFORMATION FOR SEQ ID N0386: 

(0 SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:586: 
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CCGAGAGCTG 

(2) INFORMATION FOR SEQ ID N0.587: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:587: 

CGAGAGCTGC 

(2) INFORMATION FOR SEQ ID NO:588: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:588: 
GAGAGCTGCA 

(2) INFORMATION FOR SEQ ID NO:589: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:589: 

AGAGCTGCAT 

(2) INFORMATION FOR SEQ ID NO:590: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:590: 

GAGCTGCATC 

(2) INFORMATION FOR SEQ ID NO:591: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:591: 

AGCTGCATCC 

(2) INFORMATION FOR SEQ ID NO:592: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:592: 

GCTGCATCCG 

(2) INFORMATION FOR SEQ ID NO:593: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:593: 
CTGCATCCGG 

(2) INFORMATION FOR SEQ ID NO:594: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(») SEQUENCE DESCRIPTION: SEQ IDNO:594: 

TGCATCCGGA 

(2) INFORMATION FOR SEQ ID NO:595: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:595: 
GCATCCGGAG 
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(2) INFORMATION FOR SEQ ID NO:596: 

0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:S96: 
CATCCGGAGT 

(2) INFORMATION FOR SEQ ID NO:597: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:597: 
ATCCGGAGTA 

(2) INFORMATION FOR SEQ ID NO:598: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE* nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:598: 
TCCGGAGTAC 

(2) INFORMATION FOR SEQ ID NO-.599: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Cu) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:599: 
CCGGAGTACT 

(2) INFORMATION FOR SEQ ID NO:600: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:600: 
CGGAGTACTT 

(2) INFORMATION FOR SEQ ID NO:601: 

0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:601: 

GGAGTACTTC 

(2) INFORMATION FOR SEQ ID NO:602: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:602: 

GAGTACITCA 

(2) INFORMATION FOR SEQ ID NO:603: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 603: 
AGTACTTCAA 

(2) INFORMATION FOR SEQ ID NO:604: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

fu) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:604: 

GTACTTCAAG 

(2) INFORMATION FOR SEQ ID NO:605: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:605: 

TACTTCAAGA 

(2) INFORMATION FOR SEQ ID NO:606: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
<D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:606: 

ACTTCAAGAA 

(2) INFORMATION FOR SEQ ID NO:607: 

(t) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:607: 
CTTCAAGAAC 

(2) INFORMATION FOR SEQ ID NO:608: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.608: 

TTCAAGAACT 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:609: 
TCAAGAACTG 

(2) INFORMATION FOR SEQ ID NO:610: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:610: 

CAAGAACTGC 

(2) INFORMATION FOR SEQ ID NO:61 1: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:6U: 

AAGAACTGCT 

(2) INFORMATION FOR SEQ ID NO;612: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:612: 
AGAACTGCTG 



(2) INFORMATION FOR SEQ ID NO:613: 

0) SEQUENCE CHARACTERISTICS: 
(2) INFORMATION FOR SEQ ID NO:609: (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

0) SEQUENCE CHARACTERISTICS: (Q STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:613: 
GAACTGCTGA 
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(2) INFORMATION FOR SEQ ID NO: 614: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1305 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:614: 

GAAGAGATTT GGGAGAACAT GACCTGGATG CAGTGGGAAA AAGAAATTCA CAATCACACA 60 

AAATACATAT ACTCCTTACT TGAAAAATCG CAGAACCAAC AAGAAAAGAA TGAACAAGAA 120 

CTATTGGAAT TGGATCAATG GGCAAGTTTG TGGAATTGGT TTGACATAAC AAAATGGCTG 180 

TGGTATATAA AAATATTCAT AATGGTAGTA GGAGGCTTGA TAGGTTTAAG AATAGTTTTT 240 

GCTGTACTTT CTATAGTGAA TAGAGTTAGG CAGGGATACT CACCATTGTC GTTTCAGACC 300 

CTCCTCCCAA CCCCGAGGGG ACCCGACAGG CCCGAAGGAA TCGAAGAAGA AGGTGGAGAG 360 

AGAGACAGAG ACAGATCCAC TCGATTAGTA CACGGATTCT TAGCACTTTT CTGGGACGAC 420 

CTGAGGAGCC TGTGCCTCTT CCTCTACCAC CACTTGAGAG ACTTACTCTT GATTGTAACA 460 

AGGATTGTGG AACTTCTGGG ACGCAGGGGA TGGGAAGCCC TCAAATATTG GTGGAACCTC 540 

CTAAAGTATT GGAGCCAGGA ACTGCAGAAG AGTGCTGTTA TCTTGCTCAA TGCCACCGCC 600 

ATAGCAGTAG CTGAGGGGAC AGATAGAGTT TTAGAAGTAT TACAAAGAGC TTATAGAGCT 660 

ATCCTCCACA TACCTAGAAG AATAAGACAG GGCCTCGAAA TGGCTTTGCT ATAAAATGGG 720 

TGGCAAGTGA GCAAAAAGTA GTGTAGTCAG ATAGCATGCA TCATAAGGGG TGGGGGCCAA 780 

CAACTAACAA TGCTGATCGT GCCTGGCTAG AAGCACAAGA GAAGGAAGAA GCGGGTTTTC 840 

CAGTCAAACC TCAGGTAGCT GTAGATCTTA GCCACTTTTT AAAAGAAAAG GGGGGACTGG 900 

AAGGGCTAAT TCACTCCCAA AGAAGACAAG ATACACAGTG CTGCAAACTA TTACCAGTGG 960 

AGTCAGCGAA GATAGAAGAG GCCAATGGAG GAGAAAACCA CAGATTGTTC TGTTGGGGAC 1020 

TTTCCATCCG TTGGGGACTT TCCAAGGCGG CGTGGCCTGG GTGACTAGTT CCGGTGGGGA 1080 

CTTTCCAAGA AGGCGCGGCC TGGGCGGGAC TGGGGAGTGG CGAGCCCTCA GATGCTGCAT 1140 

ATAAGCAGCT GCTTTCTGCT GTTACTGGGT CTCTCGGGTT AGACCAGATC TGAGCCTGGG 120.0 

AGCTCTCTGG CTAACTAGGG AACCCACTGC TTAAGCCTCA ATAAAGCTTG CCTTGAGTGC 1260 

TTCAAGTAGT GTGTGCCCGT CTGTTGTGTG ACTCTGGTAT CTAGA 1305 
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(2) INFORMATION FOR SEQ ID NO: 615: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:615: 

GAAACAATTT GGGATAACAT GACCTGGATG CAGTGGGAAA GAGAAATTGA CAATTACACA 60 

AACATAATAT ACACCTTAAT TGAAGAATCG CAGAACCAAC AAGAAAAAAA TGAACTAGAA 120 

TTATTGGAAT TGGATAAATG GGCAAATTTG TGGAATTGGT TTAGTATATC AAACTGGCTA 180 

TGGTATATAA AATTATTCAT AATGGTAGTA GGAGGCTTGG TAGGTTTAAG AATAGTTTTT 240 

ACTGTACTTT CTATAGTGAA TAGAGTTAGG CAGGGATACT CACCATTGTC GTTTCAGACC 300 

CACCTCCCAA CCCCGAAGGG ACCCGACAGG CCAGAAGGAA TCGAAGAAGA AGGTGGAGAG 360 

AGAGACAGAG GCAGCTCCAC TCGATTAGTG CACGGATTCT TAGCACTTTT CTGGGACGAC 420 

CTGAGGAGTC TGTGCCTCTT CAGCTACCAC CACTTGAGAG ACTTACTCTT GATTGTAACG 480 

AGGATTGTGG AACTTCTGGG ACGCAGGGGA TGGGAAGCCC TCAAATACTG GTGGAATCTC 540 

CTGCAGTATT GGAGGCAGGA ACTACAGAAG AGTGCTGTTA GCTTGTTCAA TGGCACGGCC 600 

ATAGCAGTAG CTGAGGGGAC AGATAGAGTT ATAGAAGCTT TACGAAGGGC TTATAGAGCT 660 

ATTCTCCACA TACCTAGAAG AATAAGACAG GGCTTAGAAA GGGCTTTGCT ATAAAATGGG 720 

TGGCAAGTGG TCAGAAAGTA GTGTGGTTAG AAGGCATGTA CCTTTAAGAC AAGGCAGCTA 780 

TAGATCTTAG CCGCTTTTTA AAAGAAAAGG GGGGACTGGA AGGGCTAATT CACTCACAGA 840 

GAAGATCAGT TGAACCAGAA GAAGATAGAA GAGGCCATGA AGAAGAAAAC AACAGATTGT 900 

TCCGTTTGTT CCGTTGGGGA CTTTCCAGGA GACGTGGCCT GAGTGATAAG CCGCTGGGGA 960 

CTTTCCGAAG AGGCGTGACG GGACTTTCCA AGGCGACGTG GCCTGGGCGG GACTGGGGAG 1020 

TGGCGAGCCC TCAGATGCTG CATATAAGCA GCTGCTTTCT GCCTGTACTG GGTCTCTCTG 1080 

GTTAGACCAG ATCTGAGCCT GGGAGCTCTC TGGCTAACTA GGGAACCCAC TGCTTAAGCC 1140 

TCAATAAAGC TTGCCTTGAG TGCTTCAAGT AGTGTGTGCC CGTCTGTTGT GTGACTCTGG 1200 

TATCTAGA 1208 
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(2) INFORMATION FOR SEQ ID NO:616: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:616: 

TGGAAGGGCT AATTTGGT 1 g 



(2) INFORMATION FOR SEQ ID NO:617: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE* DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:617: 

ATCTTCCCTA AAAAATTAGC CTGTC 25 



(2) INFORMATION FOR SEQ ID NO:61 8: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

GD MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:618: 
AGGCTCAGAT CTGGTCTAAC 20 



(2) INFORMATION FOR SEQ ID NO:619: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:619: 



AGCAGCAGGA AGCACTATGG 



20 
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(2) INFORMATION FOR SEQ ID NO:620: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECUUE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:620: 

TGCTAGAGAT TTTCCACAC 19 



(2) INFORMATION FOR SEQ ID NO:621: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:621: 

AGTGAATAGA GTTAGGCAGG 20 



(2) INFORMATION FOR SEQ ID NO:622: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE- nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:622: 

GTAAGACAGT ATGATCAGAT A 21 



(2) INFORMATION FOR SEQ ID NO:623: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:623: 



TTGTAGGGAA TTCCAAATTC C 



21 



WO 95/21912 



PCT/AU95/00063 



- 152 - 



(2) INFORMATION FOR SEQ ID NO:624: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:624: 
CAGGATCCTA CACCTGTCAA CATAAT 26 



(2) INFORMATION FOR SEQ ID NO:625: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:625: 
GGGAATTCCT TATTCCTGCT TG 22 



(2) INFORMATION FOR SEQ ID NO:626: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:626: 

CCAGAAGTTC CACAATCC 18 



(2) INFORMATION FOR SEQ ID NO:627: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nncleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:627: 



TTCTTCTAGG TATGTGGAG 



19 
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(2) INFORMATION FOR SEQ ID NO:628: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:628: 
AGTGAATTAG CCCTTCCAG 19 



(2) INFORMATION FOR SEQ ID NO:629: 

(0 SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

QS) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:629: 

TGCTAGAGAT TTTCCACAC 19 



(2) INFORMATION FOR SEQ ID NO:630: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:630: 

TGCTCTGGAA AACTCAT \l 



(2) INFORMATION FOR SEQ ID NO:631: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:631: 



CTTTCTATAG TGAATAGAG 



19 
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(2) INFORMATION FOR SEQ ID NO: 632: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:632: 
TATTGGAGTC AGGAACT 



(2) INFORMATION FOR SEQ ID NO: 633: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633: 
GGTCTAACCA GAGAGAC 



(2) INFORMATION FOR SEQ ID NO: 634: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 634 : 

Ala Val Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala 
15 io 



(2) INFORMATION FOR SEQ ID NO: 635: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 36 amino acids 
(B) TYPE: amino acid 
(O- STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635: 

Pro Thr Ser Gin Ser Arg Gly Asp Pro Thr Gly Pro Lys Glu Lys 
1 5 * 10 15 

Lys Lys Val Glu Arg Glu Thr Glu Thr Asp Pro Phe Asp Thr 
20 25 30 

Asp Pro His 
35 
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(2) INFORMATION FOR SEQ ID NO: 636: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:636: 

Pro Ser Ser Gin Pro Arg Gly Asp Pro Thr Gly Pro Lys Glu Ser Lys 
15 10 15 

Lys Lys Val Glu Arg Glu Thr Glu Thr Asp Pro Leu Asp Tyr Thr 
20 25 30 

Asp Ser His 
35 



(2) INFORMATION FOR SEQ ID NO: 63 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 7: 

Pro Thr Ser Gin Pro Arg Arg Asp Pro Thr Gly Gin Lys Glu Ser Lys 
1 5 10 15 

Lys Lys Val Glu Arg Glu Thr Glu Ala Ala Pro Leu Asp Cys Thr 
20 25 30 

Asp Ser His 
35 



(2) INFORMATION FOR SEQ ID NO: 638: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:638: 

Asp Pro Pro Pro Asn Pro Glu Gly Thr Arg Gin Ala Arg Arg Asn Arg 
15 10 15 

Arg Arg Arg Trp Arg Glu Arg Gin Arg Gin lie His Ser lie Ser Glu 
20 25 30 

Arg He Leu Ser Thr Tyr Leu Gly Arg Ser Ala Glu Pro Val Pro Leu 
35 40 45 

Gin Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Asn Glu Asp Cys 
50 55 60 

Gly. Thr Ser Gly Thr Gin Gly Val Gly Ser Pro Gin He Leu Val Glu 
•65 70 75 80 
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Ser Pro Thr Val Leu Glu Ser Gly Thr Lys Glu Cys Cys Leu 

85 90 95 

Ala Gin Cys His Ser His Ser Ser Ser Gly Asp Arg 

100 105 110 



(2) INFORMATION FOR SEQ ID NO: 63 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 639: 

Asp Pro Pro Pro Asn Pro Glu Gly Thr Arg Gin Ala Arg Arg Asn Arg 
1 5 10 is 

Arg Arg Arg Trp Arg Glu Arg Gin Arg Gin He His Ser He Ser Thr 
20 25 30 

Arg He Leu Ser Thr Phe Leu Gly Arg Pro Glu Glu Pro Val Pro Leu 
35 40 45 

Pro Leu Pro Pro Leu Glu Arg Leu. Thr Leu Asp Cys Asn Lys Asp Cys 
50 55 60 

Gly Thr Ser Gly Thr Gin Gly Met Gly Ser Pro Gin He Leu Val Glu 
65 70 75 80 

Pro Pro Lys Val Leu Glu Pro Gly Thr Ala Glu Glu Cys Cys Tyr Leu 
85 90 95 

Ala Gin Cys His Arg His Ser Ser Ser Gly Asp Arg 

100 105 no 



(2) INFORMATION FOR SEQ ID NO: 640: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 640 : 

Asp Pro Pro Pro Asn Pro Glu Gly Thr Arg Gin Ala Arg Arg Asn Arg 
1 5 10 15 

Arg Arg Arg Trp Arg Glu Arg Gin Arg Gin Leu His Ser He Ser Ala 
20 25 30 

Arg He Leu Ser Thr Phe Leu Gly Arg Pro Glu Glu Ser Val Pro Leu 
35 40 45 

Gin Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Asn Glu Asp Cvs 
50 55 60 

Gly Thr Ser Gly Thr Gin Gly Met Gly Ser Pro Gin He Leu Val Glu 
65 70 75 80 

Ser Pro Ala Val Leu Glu Ala Gly Thr Thr Glu Glu Cys Cys Leu 
85 90 95 v 
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Val Gin Trp His Gly His Ser Ser Ser Gly Asp Arg 
100 105 



110 



(2) INFORMATION FOR SEQ ID NO: 641: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641 : 

Glu Gin lie Trp Asn Asn Met Thr Trp Met Glu Trp Asp Arg Glu lie 
1 5 10 15 

Asn Asn Tyr Thr Ser Leu lie His Ser Leu He Glu Glu Ser Gin Asn 
20 25 30 

Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala 
35 40 45 

Ser Leu Trp Asn Trp Phe Asn He Thr Asn Trp Leu Trp Tyr He Lys 
SO 55 60 

Leu Phe He Met He Val Gly Gly Leu Val Gly Leu Arg He Val Phe 
65 70 75 60 

Ala Val Leu Ser He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu 
85 90 95 

Ser Phe Gin Thr His Leu Pro He Pro Arg Gly Pro Asp Arg Pro Glu 
100 105 110 

Gly He Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser He Arg 
115 120 125 

Leu Val Asn Gly Ser Leu Ala Leu He Trp Asp Asp Leu Arg Ser Leu 
130 135 140 

Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu He Val Thr 
145 150 155 160 

Arg He Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr 
165 170 175 

Trp Trp Asn Leu Leu Gin Tyr Trp Ser Gin Glu Leu Lys Asn Ser Ala 
180 185 190 

Val Asn Leu Leu Asn Ala Thr Ala He Ala Val Ala Glu Gly Thr Asp 
195 200 20S 

Arg Val He Glu Val Leu Gin Ala Ala Tyr Arg Ala He Arg His He 
210 215 220 

Pro Arg Arg He Arg Gin Gly Leu Glu Arg He Leu Leu 



225 



230 



235 
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(2) INFORMATION FOR SEQ ID NO: 642: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:642: 

Glu Glu lie Trp Glu Asn Met Thr Trp Met Gin Trp Glu Lys Glu lie 
15 10 15 

His Asn His Thr Lys Tyr He Tyr Ser Leu Leu Glu Lys Ser Gin Asn 
20 25 30 

Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Gin Trp Ala 
35 40 45 

Ser Leu Trp Asn Trp Phe Asp He Thr Lys Trp Leu Trp Tyr He Lys 
50 55 60 

He Phe He Met Val Val Gly Gly Leu He Gly Leu Arg He Val Phe 
65 70 75 80 

Ala Val Leu Ser He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu 
85 90 95 

Ser Phe Gin Thr Leu Leu Pro Thr Pro Arg Gly Pro Asp Arg Pro Glu 
100 105 110 

Gly He Glu Glu Met Gly Gly Glu Arg Asp Arg Asp Arg Ser Thr Arg 
115 120 125 

Leu Val His Gly Phe Leu Ala Leu Phe Trp Asp Asp Leu Arg Ser Leu 
130 135 140 

Cys Leu Phe Leu Tyr His His Leu Arg Asp Leu Leu Leu He Val Thr 
145 ISO 155 160 

Arg He Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr 
165 170 175 

Trp Trp Asn Leu Leu Lys Tyr Trp Ser Gin Glu Leu Gin Lys Ser Ala 
180 185 190 

Val He Leu Leu Asn Ala Thr Ala He Ala Val Ala Glu Gly Thr Asp 
195 200 205 

Arg Val Leu Glu Val Leu Gin Arg Ala Tyr Arg Ala He Leu His He 
210 215 220 

Pro Arg Arg He Arg Gin Gly Leu Glu Met Ala Leu Leu 
225 230 235 
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(2) INFORMATION FOR SEQ ID NO:643: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 643: 

Glu Thr lie Trp Asp Asn Met Thr Trp Met Gin Trp Glu Arg Glu He 
15 10 15 

Asp Asn Tyr Thr Asn He He Tyr Thr Leu He Glu Glu Ser Gin Asn 
20 25 30 

Gin Gin Glu Lys Asn Glu Leu Glu Leu Leu Glu Leu Asp Lys Trp Ala 
35 40 45 

Asn Leu Trp Asn Trp Phe Ser He Ser Asn Trp Leu Trp Tyr He Lys 
50 55 60 

Leu Phe He Met Val Val Gly Gly Leu Val Gly Leu Arg He Val Phe 
65 70 75 60 

Thr Val Leu Ser He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu 
85 90 95 

Ser Phe Gin Thr His Leu Pro Thr Pro Lys Gly Pro Asp Arg Pro Glu 
100 105 no 

Gly He Glu Glu Glu Gly Gly Glu Arg Asp Arg Gly Ser Ser Thr Arg 
115 120 125 

Leu Val His Gly Phe Leu Ala Leu. Phe Trp Asp Asp Leu Arg Ser Leu 
.130 135 140 

Cys Leu Phe Ser Tyr His His Leu Arg Asp Leu Leu Leu He Val Thr 
145 150 155 160 

Arg He Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr 
165 170 175 

Trp Trp Asn Leu Leu Gin Tyr Trp Arg Gin Glu Leu Gin Lys Ser Ala 
180 185 190 

Val Ser Leu Phe Asn Gly Thr Ala He Ala Val Ala Glu Gly Thr Asp 
195 200 205 

Arg Val He Glu Ala Leu Arg Arg Ala Tyr Arg Ala He Leu His He 
210 215 220 

Pro Arg Arg He Arg Gin Gly Leu Glu Arg Ala Leu Leu 
225 230 235 



WO 95/21912 



PCT/AU95/00063 



160- 



(2) INFORMATION FOR SEQ ID NO: 644: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644: 

Met Gly Gly Lys Trp Ser Lys Ser Ser Val lie Gly Trp Pro Ala Val 
1 5 10 15 

Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala Asp Gly Val Gly Ala 
20 25 30 

Val Ser Arg Asp Leu Glu Lys His Gly Ala He Thr Ser Ser Asn Thr 
35 40 45 

Ala Ala Asn Asn Ala Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu 
50 55 60 

Glu Val Gly Phe Pro Val Thr Pro Gin Val Pro Leu Arg Pro Met Thr 
65 70 75 80 

Tyr Lys Ala Ala Val Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly 
85 90 95 

Leu Glu Gly Leu He His Ser Gin Arg Arg Gin Asp He Leu Asp Leu 
100 105 HO 

Trp He Tyr His Thr Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr 
H5 120 125 

Pro Gly Pro Gly Val Arg Tyr Pro Leu Thr Phe Gly Trp Cys Tyr Lys 
130 135 140 

Leu Val Pro Val Glu Pro Asp Lys Val Glu Glu Ala Asn Lys Gly Glu 
14 5 150 155 160 

Asn Thr Ser Leu Leu His Pro Val Ser Leu His Gly Met Asp Asp Pro 
165 170 175 

Glu Arg Glu Val Leu Glu Trp Arg Phe Asp Ser Arg Leu Ala Phe His 
160 185 190 

His Val Ala Arg Glu Leu His Pro Glu Tyr Phe Lys Asn Cys 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 645: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 
(8) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 645: 

Met Gly Gly Lys Ala Lys Ser Ser Val Val Arg His Ala Ser 
1 5 io 15 

Gly Val Gly Ala Asn Asn Gin Cys 

20 25 
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(2) INFORMATION FOR SEQ ID NO:646: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 646: 

Met Gly Gly Lys Trp Ser Glu Ser Ser Val Val Arg Arg His Val Pro 
1 5 10 15 

Leu Arg Gin Gly Ser Tyr Arg Ser Pro Leu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 647 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 647: 

CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGAGGOGT GGCCTGGGCG GGACTGGGGA 60 
GTGGCGAGCC CTCA 74 



(2) INFORMATION FOR SEQ ID NO: 64 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 648: 

CTGTTGGGGA CTTTCCATCC GTTGGGGACT TTCCAAGGCG GCGTGGCCTG GGTGACTAGT 60 
TCCGGTGGGG ACTTTCCA 78 



(2) INFORMATION FOR SEQ ID NO: 649: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:649: 



CCGTTTGTTC CGTTGGGGAC TTTCCAGGAG ACGTGGCCTG AGTGACTAAG CCGCTGGGGA 60 
CTTTCCG 67 
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(2) INFORMATION FOR SEQ ID NO:6S0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 650: 

ATGGGTGGCA AGTGGTCAAA AAGTAGTGTG ATTGGATGGC CTGCTGTAAG GGAAAGAATG 60 

AGACGAGCTG AGCCAGCAGC AGATGGGGTG GGAGCAGTAT CTCGAGACCT AGAAAAACAT 120 

GGAGCAATCA CAAGTAGCAA TACAGCAGCT AACAATGCTG CTTGTGCCTG GCTAGAAGCA 180 

CAAGAGGAGG AAGAGGTGGG TTTTCCAGTC ACACCTCAGG TACCTTTAAG ACCAATGACT 240 

TACAAGGCAG CTGTAGATCT TAGCCACTTT TTAAAAGAAA AGGGGGGACT GGAAGGGCTA 300 

ATTCACTCCC AAAGAAGACA AGATATCCTT GATCTGTGGA TCTACCACAC ACAAGGCTAC 360 

TTCCCTGATT GGCAGAACTA CACACCAGGG CCAGGGGTCA GATATCCACT GACCTTTGGA 420 

TGGTGCTACA AGCTAGTACC AGTTGAGCCA GATAAGGTAG AAGAGGCCAA TAAAGGAGAG 480 

AACACCAGCT TGTTACACCC TGTGAGCCTG CATGGAATGG ATGACCCTGA GAGAGAAGTG 540 

TTAGAGTGGA GGTTTGACAG CCGCCTAGCA TTTCATCACG TGGCCCGAGA GCTGCATCCG 600 

GAGTACTTCA AGAACTGCTG A 621 

(2) INFORMATION FOR SEQ ID NO:65l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1596 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 651: 

GAACAGATTT GGAATAACAT GACCTGGATG GAGTGGGACA GAGAAATTAA CAATTACACA 60 

AGCTTAATAC ACTCCTTAAT TGAAGAATCG CAAAACCAGC AAGAAAAGAA TGAACAAGAA 120 

TTATTGGAAT TAGATAAATG GGCAAGTTTG TGGAATTGGT TTAACATAAC AAATTGGCTG 180 

TGGTATATAA AATTATTCAT AATGATAGTA GGAGGCTTGG TAGGTTTAAG AATAGTTTTT 240 

GCTGTACTTT CTATAGTGAA TAGAGTTAGG CAGGGATATT CACCATTATC GTTTCAGACC 300 

CACCTCCCAA TCCCGAGGGG ACCCGACAGG CCCGAAGGAA TAGAAGAAGA AGGTGGAGAG 360 

AGAGACAGAG ACAGATCCAT TCGATTAGTG AACGGATCCT TAGCACTTAT CTGGGACGAT 420 

CTGCGGAGCC TGTGCCTCTT CAGCTACCAC CGCTTGAGAG ACTTACTCTT GATTGTAACG 480 

AGGATTGTGG AACTTCTGGG ACGCAGGGGG TGGGAAGCCC TCAAATATTG GTGGAATCTC 540 

CTACAGTATT GGAGTCAGGA ACTAAAGAAT AGTGCTGTTA ACTTGCTCAA TGCCACAGCC 600 

ATAGCAGTAG CTGAGGGGAC AGATAGGGTT ATAGAAGTAT TACAAGCAGC TTATAGAGCT 660 

ATTCGCCACA TACCTAGAAG AATAAGACAG GGCTTGGAAA GGATTTTGCT ATAAGATGGG 720 

TGGCAAGTGG TCAAAAAGTA GTGTGATTGG ATGGCCTGCT GTAAGGGAAA GAATGAGACG 780 
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AGCTGAGCCA GCAGCAGATG GGGTGGGAGC 
AATCACAAGT AGCAATACAG CAGCTAACAA 
GGAGGAAGAG GTGGGTTTTC CAGTCACACC 
GGCAGCTGTA GATCTTAGCC ACTTTTTAAA 
CTCCCAAAGA AGACAAGATA TCCTTGATCT 
TGATTGGCAG AACTACACAC CAGGGCCAGG 
CTACAAGCTA GTACCAGTTG AGCCAGATAA 
CAGCTTGTTA CACCCTGTGA GCCTGCATGG 
GTGGAGGTTT GACAGCCGCC TAGCATTTCA 
CTTCAAGAAC TGCTGACATC GAGCTTGCTA 
GAGGCGTGGC CTGGGCGGGA CTGGGGAGTG 
TGCTTTTTGC CTGTACTGGG TCTCTCTGGT 
GCTAACTAGG GAACCCACTG CTTAAGCCTC 
TGTGTGCCCG TCTGTTGTGT GACTCTGGTA 
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AGTATCTCGA GACCTAGAAA AACATGGAGC 


840 


TGCTGCTTGT 


GCCTGGCTAG 


AAGCACAAGA 


900 


TCAGGTACCT 


TTAAGACCAA 


TGACTTACAA 


960 


AGAAAAGGGG 


GGACTGGAAG 


GGCTAATTCA 


1020 


GTGGATCTAC 


CACACACAAG 


GCTACTTCCC 


1080 


GGTCAGATAT 


CCACTGACCT 


TTGGATGGTG 


1140 


GGTAGAAGAG 


GCCAATAAAG 


GAGAGAACAC 


1200 


AATGGATGAC 


CCTGAGAGAG 


AAGTGTTAGA 


1260 


TCACGTGGCC 


CGAGAGCTGC 


ATCCGGAGTA 


1320 


CAAGGGACTT 


TCCGCTGGGG 


ACTTTCCAGG 


1380 


GCGAGCCCTC AGATGCTGCA 


TATAAGCAGC 


1440 


TAGACCAGAT 


CTGAGCCTGG 


GAGCTCTCTG 


1500 


AATAAAGCTT GCCTTGAGTG 


CTTCAAGTAG 


1560 


ACTAGA 






1596 
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(2) INFORMATION FOR SEQ ID NO:652: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:652: 
GCTTTTTGCC 

(2) INFORMATION FOR SEQ ID NO:653: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:653: 
CTTTTTGCCT 

(2) INFORMATION FOR SEQ ID NO: 654: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:654: 

TTTTTGCCTG 

(2) INFORMATION FOR SEQ ID NO:655: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:655: 

TTTTGCCTGT 

(2) INFORMATION FOR SEQ ID NO:656: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:656: 
TTTGCCTGTA 

(2) INFORMATION FOR SEQ ID NO:657: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:657: 
TTGCCTGTAC 

(2) INFORMATION FOR SEQ ID NO:658: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOUECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:658: 
TGCCTGTACT 

(2) INFORMATION FOR SEQ ID NO:659: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:659: 
GCCTGTACTG 

(2) INFORMATION FOR SEQ ID NO:660: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

0i) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:660: 

CCTGTACTGG 
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(2) INFORMATION FOR SEQ ID NO:661: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:661: 
CTGTACTGGG 

(2) INFORMATION FOR SEQ ID NO:662: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:662: 
TGTACTGGGT 

(2) INFORMATION FOR SEQ ID NO: 663: 

(Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nudeic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOI£CULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:663: 

GTACTGGGTC 

(2) INFORMATTON FOR SEQ ID NO:664: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(H) MOLECUUE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:664: 

TACTGGGTCT 

(2) INFORMATION FOR SEQ ID.NO:665: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:665: 
ACTGGGTCTC 

(2) INFORMATION FOR SEQ ID NO:666: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:666: 
CTGGGTCTCT 

(2) INFORMATION FOR SEQ ID NO:667: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:667: 

TGGGTCTCTC 

(2) INFORMATION FOR SEQ ID NO;668: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:668: 
GGGTCTOCT 

(2) INFORMATTON FOR SEQ ID NO:669: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:669: 
GGTCTCTCTG 

(2) INFORMATION FOR SEQ ID NO:670: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:670: 

GTCTCTCTGG 

(2) INFORMATION FOR SEQ ID NO:671: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:671: 
TCICTCTGGT 

(2) INFORMATION FOR SEQ ID NO:672: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY; linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:672: 

CTCTCTGGTT 

(2) INFORMATION FOR SEQ ID NO:673: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:673: 
TCTCTGGTTA 

(2) INFORMATION FOR SEQ ID NO:674: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:674: 
CTCTGGTTAG 

(2) INFORMATION FOR SEQ ID NO:675: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:675: 

TCTCTGGTTA 

(2) INFORMATION FOR SEQ ID NO:676: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:676: 

CTGGTTAGAC 

(2) INFORMATION FOR SEQ ID NO:677: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:677: 

TGGTTAGACC 

(2) INFORMATION FOR SEQ ID NO;678; 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:678: 
GGTTAGACCA 

(2) INFORMATION FOR SEQ ID NO:679: 
(0 SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:679: 
GTTAGACCAG 



(2) INFORMATION FOR SEQ ID NO:680: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:680: 
TTAGACCAGA 



(2) INFORMATION FOR SEQ ID NO:68 1 : 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(n) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:681: 

TAGACCAGAT 



(2) INFORMATION FOR SEQ ID NO:682: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:682: 
AGACCAGATC 



(2) INFORMATION FOR SEQ ID NO:683: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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GACCAGATCT 



(2) INFORMATION FOR SEQ ID NO:684: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:684: 
ACCAGATCTG 



(2) INFORMATION FOR SEQ ID NO:685: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 5: 
CCAGATCTGA 



(2) INFORMATION FOR SEQ ID NO:686: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:686: 

CAGATCTGAG 



(2) INFORMATION FOR SEQ ID NO:687: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:687: 

AGATCTGAGC 



(2) INFORMATION FOR SEQ ID NO:688: 



(ii) MOLECUl£ TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:683: 



0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:688: 
GATCTGAGCC 



(2) INFORMATION FOR SEQ ID NO:689: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:689: 

ATCTGAGCCT 



(2) INFORMATION FOR SEQ ID NO:690: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single • 
(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:690: 

TCTGAGCCTG 



(2) INFORMATION FOR SEQ ID NO:691: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 691: 
CTGAGCCTGG 



(2) INFORMATION FOR SEQ ID NO:692: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:692: 

TGAGCCTGGG 
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(2) INFORMATION FOR SEQ ID NO:693: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:693: 
GAGCCTGGGA 



(2) INFORMATION FOR SEQ ID NO:694: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:694: 

AGCCTGGGAG 



(2) INFORMATION FOR SEQ ID NO:695: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:695: 

GCCTGGGAGC 



(2) INFORMATION FOR SEQ ID NO:696: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

01) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:696: 
CCTGGGAGCT 



(2) INFORMATION FOR SEQ ID NO:697: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:697: 
CTGGGAGCTC 

(2) INFORMATION FOR SEQ ID NO:698: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
P) TOPOLOGY: linear 

GO MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:698: 

TGGGAGCTCT 

(2) INFORMATION FOR SEQ ID NO:699: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:699: 
GGGAGCTCTC 

(2) INFORMATION FOR SEQ ID NO:700: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:700: 
GGAGCTCTCT 

(2) INFORMATION FOR SEQ ID NO:701: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:70I: 
GAGCTCTCTG 

(2) INFORMATION FOR SEQ ID NO:702: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:702: 
AGCTCTCTGG 

(2) INFORMATION FOR SEQ ID NO:703: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:703: 
GCTCTCTGGC 

(2) INFORMATION FOR SEQ ID NO:704: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:704: 

CTCTCTGGCr 

(2) INFORMATION FOR SEQ ID NO:705: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:705: 
TCTCTGGCTA 

(2) INFORMATION FOR SEQ ID NO:706: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:706: 
CTCTGGCTAA 

(2) INFORMATION FOR SEQ ID NO:707: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:707: 
TCTGGCTAAC 

(2) INFORMATION FOR SEQ ID NO:708: 

G) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:708: 

CTGGCTAACT 

(2) INFORMATION FOR SEQ ID NO:709: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:709: 
TGGCTAACTA 

(2) INFORMATION FOR SEQ ID NO:710: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
P) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:710: 
GGCTAACTAG 

(2) INFORMATION FOR SEQ ID NO:71I: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:711: 
GCTAACTAGG 

(2) INFORMATION FOR SEQ ID NO:712: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:712: 
CTAACTAGGG 

(2) INFORMATION FOR SEQ ID NO:7I3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:713: 
TAACTAGGGA 

(2) INFORMATION FOR SEQ ID NO:714: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:714: 
AACTAGGGAA 

(2) INFORMATION FOR SEQ ID NO:715: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi*) SEQUENCE DESCRIPTION: SEQ ID NO:715: 
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ACTAGGGAAC 

(2) INFORMATION FOR SEQ ID NO:716: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:716: 

CTAGGGAACC 

(2) INFORMATION FOR SEQ ID NO:7I7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:717: 

TAGGGAACCC 

(2) INFORMATION FOR SEQ ID NO:7I8: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:718: 

AGGGAACCCA 

(2) INFORMATION FOR SEQ ID NO:7I9: 

0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:719: 

GGGAACCCAC 

(2) INFORMATION FOR SEQ ID NO:720: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucleic acid 



(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi*) SEQUENCE DESCRIPTION: SEQ ID NO:720: 

GGAACCCACT 

(2) INFORMATION FOR SEQ ID NO:721: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72I: 

GAACCCACTC 

(2) INFORMATION FOR SEQ ID NO:722: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:722: 

AACCCACTGC 

(2) INFORMATION FOR SEQ ID NO:723: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:723: 

ACCCACTGCT 

(2) INFORMATION FOR SEQ ID NO:724: 

0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:724: 

CCCACTGCIT 
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(2) INFORMATION FOR SEQ ID NO:725: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:725: 
CCACTGCTTA 

(2) INFORMATION FOR SEQ ID NO:726: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:726: 
CACTGCTTAA 

(2) INFORMATION FOR SEQ ID NO:727: 

(i) SEQUENCE CHARACTERISTICS: . , 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE* DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:727: 

ACTGCTTAAG 

(2) INFORMATION FOR SEQ ID NO:728: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:728: 

CTGCTTAAGC 

(2) INFORMATION FOR SEQ ID NO:729: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:729: 
TGCTTAAGCC 

(2) INFORMATION FOR SEQ ID NO:730: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:730: 

GCTTAAGCCT 

(2) INFORMATION FOR SEQ ID NO:731 : 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73I: 

CTTAAGCCTC 

(2) INFORMATION FOR SEQ ID NO:732: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:732: 

TTAAGCCTCA 

(2) INFORMATION FOR SEQ ID NO:733: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(9 STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:733: 

TAAGCCTCAA 

(2) INFORMATION FOR SEQ ID NO:734: 
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0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:734: 
AAGCCTCAAT 

(2) INFORMATION FOR SEQ ID NO:735: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE' nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:735: 

AGCCTCAATA 

(2) INFORMATION FOR SEQ ID NO:736: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:736: 

GCCTCAATAA 

(2) INFORMATION FOR SEQ ID NO:737: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:737: 

CCTCAATAAA 

(2) INFORMATION FOR SEQ ID NO:738: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:738: 
CTCAATAAAG 

(2) INFORMATION FOR SEQ ID NO:739: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:739: 

TCAATAAAGC 

(2) INFORMATTON FOR SEQ ID NO:740: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:740: 

CAATAAAGCT 

(2) INFORMATION FOR SEQ ID NO:74I: 

0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74I: 

AATAAAGCTT 

(2) INFORMATTON FOR SEQ ID NO:742: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:742: 

ATAAAGCTTG 

(2) INFORMATION FOR SEQ ID NO:743: 
0) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:743: 

TAAAGCTTGC 



(2) INFORMATION FOR SEQ ID NO:744: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:744: 
AAAGCTTGCC 



(2) INFORMATION FOR SEQ ID NO:745: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:745: 
AAGCTTGCCT 



(2) INFORMATION FOR SEQ ID NO:746: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE" DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:746: 

AGCTTGCCTT 



(2) INFORMATION FOR SEQ ID NO:747: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:747: 
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GCTTGCCTTG 



(2) INFORMATION FOR SgQ ID NO:748: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:748: 
CTTGCCTTGA 



(2) INFORMATION FOR SEQ ID NO:749: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:749: 
TTGCCTTGAG 



(2) INFORMATION FOR SEQ ID NO:750: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ED NO:750: 

TGCCTTGAGT 



(2) INFORMATION FOR SEQ ID N0:7S1: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:751: 

GCCTTGAGTG 



(2) INFORMATION FOR SEQ ID NO:752: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE nucleic acid 
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(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:752: 

CCTTGAGTGC 



(2) INFORMATION FOR SEQ ID NO:753: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TVPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:753: 

CTTGAGTGCT 



(2) INFORMATION FOR SEQ ID NO:754: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:754: 
TTGAGTGCTT 



(2) INFORMATION FOR SEQ ID NO:755: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:755: 
TGAGTGCTTC 



(2) INFORMATION FOR SEQ ID NO:756: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:756: 

GAGTGCTTCA 
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(2) INFORMATION FOR SEQ ID NO:757: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:757; 
AGTGCTTCAA 



(2) INFORMATION FOR SEQ ID NO:758: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:758: 
GTGCTTCAAG 



(2) INFORMATION FOR SEQ ID NO:759: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:759: 

TGCTTCAAGT 



(2) INFORMATION FOR SEQ ID NO:760: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(x0 SEQUENCE DESCRIPTION: SEQ ID NO:760: 
GCTTCAAGTA 



(2) INFORMATION FOR SEQ ID NO:761: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:761: 
CTTCAAGTAG 

(2) INFORMATION FOR SEQ ID N0.762: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:762: 
TTCAAGTAGT 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:766: 
AGTAGTGTGT 



(2) INFORMATION FOR SEQ ID NO:767: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO:763: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(B) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:763: 
TCAAGTAGTC 

(2) INFORMATION FOR SEQ ID NO:764: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:764: 
CAAGTAGTGT 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:767: 
GTAGTGTCTG 

(2) INFORMATION FOR SEQ ID NO:768: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic arid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:768: 

TAGTGTGTCC 

(2) INFORMATION FOR SEQ ID NO:769: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO:765: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:765: 

AAGTAGTGTG 



00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:769: 
AGTGTGTCCC 

(2) INFORMATION FOR SEQ ID NO;770: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO:766: 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:770: 
GTGTGTGCCC 

(2) INFORMATION FOR SEQ ID NO:771 : 

CD SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:77I: 

TGTGTGCCCG 

(2) INFORMATION FOR SEQ ID NO:772: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:772: 
GTGTGCCCGT 

(2) INFORMATION FOR SEQ ID NO:773: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:773: 

TGTGCCCGTC 

(2) INFORMATION FOR SEQ ID NO:774: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:774: 

GTGCCCGTCT 

(2) INFORMATION FOR SEQ ID NO:775: 

(0 SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:775: 

TGCCCGTCTG 

(2) INFORMATION FOR SEQ ID NO:776: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
P) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:776: 
GCCCGTCTGT 

(2) INFORMATION FOR SEQ ID NO:777; 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 

(D) TOPOLOGY: linear . 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:777: 
CCCGTCTGTT 

(2) INFORMATION FOR SEQ ID NO:778: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:778: 
CCGTCTGTTG 

(2) INFORMATION FOR SEQ ID NO:779: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:779: 
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CGTCTGTTGT 

(2) INFORMATION FOR SEQ ID NO:780: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:780: 
GTCTGTTGTG 

(2) INFORMATION FOR SEQ ID NO:78I: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:78I: 
TCTGTTGTGT 

(2) INFORMATION FOR SEQ ID NO:782: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:782: 

CTGTTGTGTG 

(2) INFORMATION FOR SEQ ID NO:783: 

0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:783: 
TGTTGTGTGA 

(2) INFORMATION FOR SEQ ID NO:784: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 



(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:784: 

GTTGTGTGAC 

(2) INFORMATION FOR SEQ ID NO:785: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:785: 

TTGTGTGACT 

(2) INFORMATION FOR SEQ ID NO:786: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:786: 

TGTGTGACTC 

(2) INFORMATION FOR SEQ ID NO:787: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:787: 

GTGTGACTCT 

(2) INFORMATION FOR SEQ ID NO:788: 

0) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:788: 

TGTGTGACTC 
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(2) INFORMATION FOR SEQ ID NO:789: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
<D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ DO NO:789: 

GTGTGACTCT 

(2) INFORMATION FOR SEQ ID NO:790: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:790: 
TGTGACTCTG 

(2) INFORMATION FOR SEQ ID NO:791: 

G) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID NO:791: 

GTGACTCTGG 

(2) INFORMATION FOR SEQ ID NO:792: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(u) MOLECULE TVPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:792: 
TGACTCTGGT 

(2) INFORMATION FOR SEQ ID NO:793: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:793: 
GACTCTGGTA 

(2) INFORMATION FOR SEQ ID NO:794: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:794: 

ACTCTGGTAA 

(2) INFORMATION FOR SEQ ID NO:795: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 795: 
CTCTGGTAAC 

(2) INFORMATION FOR SEQ ID NO:796: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

fu) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:796: 
TCTGGTAACT 

(2) INFORMATION FOR SEQ ID NO:797: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:797: 
CTGGTAACTA 
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(2) INFORMATION FOR SEQ ID NO:798: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base paiis 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:798: 
TGGTAACTAG 



(2) INFORMATION FOR SEQ ID NO:799: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:799: 
GGTAACTAGA 
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(2) INFORMATION FOR SEQ ID NO:800: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 800: 

TGGAAGGGCT AATTCACTCA CGGAAAAGAC CAGTTGAACC AGAAGAAGAT AGAAGAGGCC 60 

ATGAAGAAGA AAACAACAGA TTGTTCTGCT TGCTCAGCTG GGGACTTTCC AGAAGGCGCG 120 

GCCTGAGTGA CTAAGCCCCG TTGGGGACTT TCCGAAGAGG CATGAAGGGA CTTTCCAAGG 180 

CAGGCGTGGC CTGGGCGGGA CTGGGGAGTG GCGAGCCCTC AGATGCTGCA TATAAGCAGC 240 

TGCTTTCTGC CTGTACTGGG TCTCTCTGGT TAGACCAGAT CTGAGCCTGG GAGCTCTCTG 300 

GCTAGCTAGG GAACCCACTG CTTAAGCCTC AATAAAGCTT GCCTTGAGTG CTTCAAGTAG 360 

TGTGTGCCCG TCTGTTGTGT GACTCTGGTA TCTAGAGATC CCTCAGACCA TTTTAGTCCG 420 

TGTGGAAAAT CTCTAGCAGT GGCGCCCGAA CAGGGACTTG AAAGCGAAAG GAAAACCAGA 480 

GGAGCTCTCT CGACGCAGGA CTCGGCTTGC TGAAGCGCGC ACGGCAAGAG GCGAGGGGCG 540 

GCGACTGGTG AGTACGCCGA AAATTTTGAC TAGCGGAGGC TAGAAGGAGA GAGATGGGTG 600 

CGAGAGCGTC AATATTAAGC GGGGGAAAAT TAGATAGATG GGAGAAAATT CGGTTAAGGC 660 

CAGGAGGAAA GAAAAAGTAT AAATTAAAAC ATATAGTATG GGCAAGCAGG GAGCTAGAAC 720 

GATTCGCAGT CAATCCTGGC CTGTTGGAAA CATCAGAAGG CTGTAGACAA ATACTGGGAC 780 

AGTTACACCC GTCCCTTCAG ACAGGATCAG AAGAACTTAA ATCAGTATAT AATGCAGTAG 840 

CAGTCCTCTA TTGTGTGCAT CAAAACATAG ACATAAAGGA CACCAAGGAA GCTTTAGAAA 900 

AGATAGAGGA AGAGCAAAAC AAATGTAAGA AAAAAGCACA GCAAGCAGCA GCACAGCAAG 960 

CAGCAGCTGG CACAGGAAAC AGCAACCCGG TCAGCCAAAA TTACCCTATA GTACAGAACA 1020 

TGCAGGGGCA AATGGTACAT CAGGCCATAT CACCTAGAAC TTTAAATGCA TGGGTAAAAG 1080 

TAATAGAAGA GAAGGCTTTC AGCCCAGAGG TAATACCCAT GTTTTCAGCA TTATCAGAAG 1140 

GAGCCACCCC ACAAGATTTA AACACCATGC TAAACACAGT GGGGGGACAT CAAGCAGCTA 1200 

TGCAAATGTT AAAAGAGACC ATCAATGAGG AAGCTGCAGA ATGGGATAGA TTACATCCAG 1260 

CGCAGGCAGG GCCTGTTGCA CCAGGCCAGA TGAGAGACCC AAGGGGAAGT GACATAGCAG 1320 

GAACTACTAG TACCCTTCAG GAACAAATAG GATGGATGAC AGGTAATCCA GCTATCCCAG 1380 

TAGGAGAAAT CTATAAAAGA TGGATAATCC TGGGATTAAA TAAAATAGTA AGGATGTATA 1440 

GCCCTATCAG CATTCTGGAC ATAAAACAAG GACCAAAGGA ACCCTTTAGA GACTATGTAG 1500 

ACCGGTTCTA TAAAACTCTA AGAGCCGAGC AAGCTACACA GGAGGTAAAA AATTGGATGA 1560 

CAGAAACCTT GTTGGTCCAA AATGCAAACC CAGATTGTAA GACTATTTTA AAAGCATTGG 1620 
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GACCAGCAGC TACACTAGAA GAAATGATGA CAGCATGTCA GGGAGTGGGA GGACCCAGCC 1680 

ATAAAGCAAG AGTTTTGGCA GAAGCAATGA GCCAAGCAAC AAATGCAGCT ACTGTAATGA 1740 

TGCAGAGAAG CAATTTTAGA AACCAAAGAA AGAATGTTAA GTGTTTCAAT TGTGGCAAAG . 1800 

AAGGGCACAT AGCCAGAAAT TGCAGGGCTC CTAGGAAAAG GGGCTGTTGG AAATGTGGAA 1860 

AGGAAGGACA CCAAATGAAA GATTGTACTG AGAGACAGGC TAATTTTTTA GGGAAAATCT 1920 

GGCCTTCCCA CAAGGGGAGG CCAGGGAACT TTCTTCAGAG CAGGCCAGAA CCAACAGCCC 1980 

CTCTCCAGGG CAGGCCGGAG CCATCAGCCC CGCCAGAAGA GAGCTTCAGG TTTGGGGAGG 2040 

AGACAACAAC TCCCTCTCAG AAGCAGGAGC CGATAGACAG GGACAGGGAT CTGTATCCTT 2100 

TAGCTTCCCT CAGATCACTC TTTGGCAACG ACCCCTCGTC ACAATAAAGA TAGGGGGGCA 2160 

GCTGAAGGAA GCTCTATTAG ATACAGGAGC AGATGATACA GTATTAGAAG ACATGCATTT 2220 

GCCAGGAAAA TGGAAACCAA AAATGATAGG GGGAATTGGA GGTTTTATCA AAGTAAAACA 2280 

ATATGATGAA ATTCTTGTAG AAATCTGTGG ACATAAAGCT ATAGGTACAG TATTAGTAGG 2340 

ACCTACACCT GTCAACATAA TTGGAAGAAA TCTGTTGACT CAGATTGGTT GCACTTTAAA 2400 

TTTTCCCATT AGTCCTATTG AAACTGTACC AGTACAATTA AAGCCAGGAA TGGATGGCCC 2460 

AAAGGTTAAA CAATGGCCAT TGACAGAAGA GAAAATAAAA GCATTAGTAG AAATTTGTAC ; 2520 

AGAAATGGAA AAGGAAGGAA AGATTTCAAA AATTGGGCCT GAAAATCCAT ACAATACTCC 2580 

AGTATTTGCC ATAAAGAAAA AAGATGGTAC TAAATGGAGA AAATTAGTAG ATTTCAGAGA 2640 

CCTTAATAAG AGAACTCAAG ACTTCTGGGA AGTTCAATTA GGAATACCAC ATCCCTCAGG 2700 

ATTAAAAAAG AAAAAATCAG TAACAGTACT GGATGTGGGT GATGCATACT TTTCAGTTCC 2760 

CTTAGATGAA AACTTCAGGA AGTATACTGC ATTTACCATA CCTAGTATAA ATAATGAGAC 2820 

ACCAGGGATT AGATATCAGT ACAATGTGCT TCCACAGGGA TGGAAAGGAT CACCAGCAAT 2880 

ATTCCAAAGT AGCATGACAA GAATCTTAGA GCCTTTTAGA AGACAAAATC CAGACATAGT 2940 

TATCTATCAA TACATGGATG ACTTGTATGT AGGATCTGAT TTAGAAATAG GACAGCATAG 3000 

AATAAAAATA GAGGAACTGA GACAACATCT GTTGAAGTGG GGATTTACCA CACCAGACAA 3060 

AAAGCATCAG AAAGAACCCC CATTCCTTTG GATGGGTTAT GAACTCCATC CTGATAAATG 3120 

GACAGTGCAA CCTATAGTAC TGCCAGAAAA AGACAGCTGG ACTGTCAATG ACATACAGAA 3180 

GTTAGTGGGT AAATTAAATT GGGCAAGTCA GATTTACCCA GGAATTAAAG TAAGGCAATT 3240 

ATGTAAACTC CTTAGGGGAA CCAAAGCACT AACAGAAGTA ATACCACTAA CAGAAGAAGC 3300 

AGAGCTAGAA CTGGCAGAAA ACAGGGAAAT TCTAAGAGAA CCAGTACATG GAGTGTATTA 3360 

TGACCCATCA AAAGACTTAA TAGCAGAAAT ACAGAAGCAG GAGCAAGGCC AATGGACATA 3420 

TCAAATTTAT CAAGATCAAT TTAAAAATCT AAAAACAGGA AAGTATGCAA GATTGAGGGG 3480 

TGCCCACACT AATGATGTAA AACAATTTCC AGAGGCAGTG CAAAAAATAG CCACAGAAAG 3540 

CATAGTAATA TGGGGAAAGA CTCCTAAATT TAGACTACCC ATACAAAAAG AAACATGGGA 3600 

CGCATGGTGG ACAGAGTATT GGCAAGCCAC CTGGATTCCT GAGTGGGAGT TTGTCAATAC 3660 
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CCCTCCCCTA GTAAAATTAT GGTACCAGTT AGAAAAAGAA CCCATAATAG GAGCAGAAAC 3720 

TTTCTATGTA GATGGGGCAG CTAACAGAGA GACTAAATTA GGAAAAGCAG GATATGTTAC 3780 

TGACAGAGGA AGACAAAAAG TTGTCTCCCT AACTGACACA ACAAATCAGA AGACTGAGTT 3840 

ACAAGCAATT CATCTAGCTT TGCAGGATTC AGGATTAGAA GTAAACATAG TAACAGACTC 3900 

ACAGTATGCA TTAGGAATCA TTCAAGCACA ACCAGATAAA AGTGAATCAG AAATAGTCAA 3960 

TCAAATAATA GAGCAATTAA TAAAAAAGGA AAAGGTCTAC CTGGCATGGG TACCAGCACA 4020 

CAAAGGAATT GGAGGGAATG AACAAGTAGA TAAATTAGTC AGTGCTGGAA TCAGGAAAAT 4080 

ACTATTTTTA GATGGAATAG ATAAGGCACA AGAAGGCCAT GAGAAATATC ACAGTAATTG 4140 

GAGAGCAATG GCTAGTGGTT TTAACCTGCC ACCTATAGTA GCAAAAGAAA TAGTAGCCAG 4200 

CTGTGATAAA TGTCAGCTAA AAGGAGAAGC CATGCATGGA CAAGTAGACT GTAGTCCAGG 4260 

AATATGGCAA CTAGATTGTA CACATCTAGA AGGAAAAATT ATCCTGGTAG CAGTTCATGT 4320 

AGCCAGTGGA TATATAGAAG CAGAAGTTAT TCCAGCAGAG ACAGGGCAGG AAACAGCATA 4380 

CTTTATCTTA AAATTAGCAG GAAGGTGGCC AGTAAACACA ATACATACAG ACAATGGCGG 4440 

CAATTTCATC AGTACCACGG TTAAGGCCGC CTGTTGGTGG GCAGGGATCA AGCAGGAATT 4500 

TGGCATTCCC TACAATCCCC AAAGCCAAGG AGTAGTGGAA TCTATGAATA GAGAATTAAA 4560 

GAAAATTATA GGACAGGTAA GAGATCAGGC TGAACATCTT AAGACAGCAG TACAAATGGC 4620 

AGTATTCATC CACAATTTTA AAAGAAAAGG GGGGATTGGG GGATACAGTG CAGGGGAAAG 4680 

AATAGTAGAC ATAATAGCAA CAGACATACA AACTAAAGAA TTACAAAAGC AAATTACAAA 4740 

AATTCAAAAT TTTCGGGTTT ATTACAGGGA CAGCAGAGAT CCACTTTGGA AAGGACCAGC 4800 

AAAACTTCTC TGGAAAGGCG AAGGGGCAGT AGTAATACAA GATAATAGTG ACATAAAAGT 4860 

AGTGCCAAGA AGAAAAGTAA AGATCATTAG GGATTATGGA AAACAGATGG CAGGTGATGA 4920 

TTGTGTGGCA AGTAGACAGG ATGAGGATTA GAACATGGAA CAGTTTAGTG AAACACCATA 4980 

TGTATGTTTC AAAGAAAGCT AAGGGATGGA TTTATAGACA TCACTATGAA AACACTCATC 5040 

CAAAAATAAG CTCAGAAGTA CACATCCCAC TAGGGGAAGC TAGATTGGTA ATAACAACAT 5100 

ATTGGGGTCT ACATACAGGA GAAAGAGACT GGCATTTGGG TCAGGGAGTC TCCATAGAAT 5160 

GGAGGGAAAG GACATATAGA ACACAAGTAG ACCCCGAACT AGCAGACCAA CTAATTCATA 5220 

TACATTACTT TGATTGTTTT TCAGAATCTG CCATAAGAAG TGCCATATTA GGATATAGAG 5280 

TTAGGCATAG GTGTGAATAT CAAGCAGGAC ATAACAAGGT AGGATCTCTA CAGTACTTGG 5340 

CACTAACAGC ATTAATAACA CCAAAGAAGA TAAAGCCACC TTTGCCTAGT GTTGCGAAAC 5400 

TGACAGAGGA TAGATGGAAC AAGCCCCAGA AGACCAAGGG CCACAGAGGC AGCCATACAA 5460 

TGAATGGACA CTAGAACTTT TAGAGGAGCT TAAGAATGAA GCTGTTAGGC ATTTTCCTAG 5520 

GGTATGGCTC CATGGCTTAG GGCAACATAT CTATGAAACT TATGGGGATA CTTGGGAAGG 5580 

AGTGGAGGCC ATAACAAGAA CTCTGCAACA ACTGCTGTTT ATTCATTTCA GAATTGGGTG 5640 

TCAACATAGC AGAATAGGCA TTATTCGACA GAGGAGAGCA AGAAATGGAG CCAGTAGATC 5700 
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CTAGACTAGA GCCCTGGAAG CATCCAGGAA GTCAGCCTAA GACTGCGTGT ACCACTTGCT 5760 

ATTGTAAAAA GTGCTGCTTT CATTGCCAAG TTTGTTTTAT GACAAAAGGC TTAGGCATCT 5820 

CCTATGGCAG GAAGAAGCGG AGACAGCGAC GAAGAGCTCC TCAAGACAGT CAGACTCATC 5880. 

AAGCTTATCT ATCAAAGCAG TAAGTAATAT ATGTAATGCA ACCTTTACAA ATAGTAGCAA 5940 

TAGTAGCATT AGTAGTAGCA GGAATAATAG CAATAGTTGT GTGGACCATA GTATTCATAG 6000 

AATATAAGAA AATATTAAGA CAAAGAAAAA TAGACAGGTT GATTGATAGA ATAAGAGAAA 6060 

GAGCAGAAGA CAGTGGCAAT GACAGTGAAG GGGATCAGGA AGAATTATCG GCACTTGTGG 6120 

ACATGGGGGA CCATGATCCT TGGGATATTA ATGATCTGTA GAGCTGCAAA CAATTTGTGG 6180 

GTCACAGTCT ATTATGGGGT ACCTGTGTGG AGAGAAGCAA CCACCACTCT ATTTTGTGCA 6240 

TCAGATGCCA AGGCATATGA TGCAGAGGTA CATAATGTTT GGGCCACACA TGCCTGTGTA 6300 

CCCACAGACC CTAACCCACA AGAAGTAGAA TTGAAAAATG TGACAGAAAA TTTTAACATG 6360 

TGGAAAAATA ACATGGTAGA ACAGATGCAT GAGGATATAA TCAGTTTATG GGATCAAAGC 6420 

CTGAAGCCAT GTGTAAAATT AACCCCACTC TGTGTTTCTT TAAATTGCAC TGATGCTACT 6480 

AATACCACTA ATAGTAATAC CACTAGCAGC AGCGAGAAAC CGAAGGGGAC AGGGGAAATA 6540 

AAAAACTGCT CTTTCAATAT CACCACAAGC ATAAGAGATA AGGTGCAGAA ACAATATGCA 6600 

CTTTTTTATA GCCTTGATGT AGTACCAATG GATGATAATG ATAATAGTAC AAGCTATAGG ,6660 

TTAATAAGTT GTAACACCTC AATCATTACA CAGGCCTGTC CAAAGATATC CTTTGAGCCA 6720 

ATTCCCATAC ATTATTGTGC CCCGGCTGGT TTTGCGATTC TAAAGTGTAA AGATAAAAGG 6780 

TTCAATGGAA AAGGACCATG TACAAGTGTC AGCACAGTAC AGTGTACACA TGGAATTAGG 6840 

CCAGTAGTAT CAACTCAACT GTTGTTAAAT GGCAGTCTAG CAGAAGAAGA GGTAGTAATT 6900 

AGATCTGACA ATTTTACGAA CAATGCTAAA ACCATAATAG TACAGCTGAG CAAATCTGTA 6960 

GAAATTACTT GTGTAAGACC CAACAACAAT ACAAGAAAAA GTATAAGTAT GGGACCAGGG 7020 

AGAGCATTTT ATACAACAGA AATAATAGGA GATATAAGAC AAGCATATTG TAACATTAGT 7080 

AAAGCAAACT GGACTGACAC TTTAGAACAG ATAGCTAGAA AATTAAGAGA ACAATTTGAG 7140 

AATAAAACAA TAGTCTTTAA GCCATCCTCA GGAGGGGACC CAGAAATTGT AACACAGTTT 7200 

TACAGTTTTA ATTGTGGAGG GGAATTTTTC TACTGTAATT CAACACAACT GTTTAATGGT 7260 

ACTTGGAATG GTACTTGGGT TAATGGTACT TGGAGTAGTA ATAATACGAC TGATACTGCA 7320 

AATATCACAC TCCCATGCAG AATAAAACAA TTTATAAACA TGTGGCAGGA AGTAGGAAAA 7380 

GCAATGTATG CCCCTCCCAT CAAAGGACAA ATTAAATGTA CATCAAATAT TACAGGGCTG 7440 

ATATTAACAA GAGATGGTGG TAACAATAAC ACCACGAACG ACAACGAGAC CGAGACCTTC 7500 

AGACCTGGAG GAGGAGATAT GAGGGACAAT TGGAGAAGTG AATTATATAA ATATAAAGTA 7560 

GTACAAGTTG AACCATTAGG AGTAGCACCC ACCAAGGCAA AGAGAAGAGT GGTGCAAAGA 7620 

GAAAAAAGAG CAGTGGGAAT AGGAGCTATG TTCCTTGGGT TCTTAGGAGC AGCAGGAAGC 7680 

ACTATGGGCG CAGCGTCAGT GACGCTGACG GTACAAGCCA GACAATTATT GTCTGGTATA 7740 
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GTGCAGCAGC AGAACAATCT GCTGAGGGCT ATTGAGGCGC AACAGCATCT GTTGCAACTC 7800 

ACAGTCTGGG GCATCAAACA GCTCCAGGCA AGAGTCCTGG CTGTGGAAAG ATACCTAAGG 7860 

GATCAACAGC TCCTGGGACT TTGGGGTTGC TCTGGAAAAC TCATTTGCAC CACTACTGTG 7920 

CCTTGGAACA ATAGCTGGAG TAATAAATCT CTGGAAACAA TTTGGGATAA CATGACCTGG 7980 

ATGCAGTGGG AAAGAGAAAT TGACAATTAC ACAAACATAA TATACACCTT AATTGAAGAA 8040 

TCGCAGAACC AACAAGAAAA AAATGAACTA GAATTATTGG AATTGGATAA ATGGGCAAAT 8100 

TTGTGGAATT GGTTTAGTAT ATCAAACTGG CTATGGTATA TAAAATTATT CATAATGGTA 8160 

GTAGGAGGCT TGGTAGGTTT AAGAATAGTT TTTACTGTAC TTTCTATAGT TAATAGAGTT 8220 

AGGCAGGGAT ACTCACCATT ATCGTTTCAG ACCCACCTCC CAACCCCGAA GGGACCCGAC 8280 

AGGCCAGAAG GAATCGAAGA AGAAGGTGGA GAGAGAGACA GAGGCAGCTC CACTCGATTA 8340 

GTGCACGGAT TCTTAGCACT TTTCTGGGAC GACCTGAGGA GTCTGTGCCT CTTCAGCTAC 8400 

CACCACTTGA GAGACTTACT CTTGATTGTA ACGAGGATTG TGGAACTTCT GGGACGCAGG 8460 

GGATGGGAAG CCCTCAAATA CTGGTGGAAT CTCCTGCAGT ATTGGAGGCA GGAACTACAG 8520 

AAGAGTGCTG TTAGCTTGTT CAATGGCACG GCCATAGCAG TAGCTGAGGG GACAGATAGA 8580 

GTTATAGAAG CTTTACGAAG GGCTTATAGA GCTATTCTCC ACATACCTAG AAGAATAAGA 8640 

CAGGGCTTAG AAAGGGCTTT GCTATAAAAT GGGTGGCAAG TGGTCAGAAA GTAGTGTGGT 8700 

TAGAAGGCAT GTACCTTTAA GACAAGGCAG CTATAGATCT TAGCCGCTTT TTAAAAGAAA 8760 

AGGGGGGACT GGAAGGGCTA ATTCACTCAC GGAAAAGACC AGTTGAACCA GAAGAAGATA 8820 

GAAGAGGCCA TGAAGAAGAA AACAACAGAT TGTTCTGCTT GCTCAGCTGG GGACTTTCCA 8880 

GAAGGCGCGG CCTGAGTGAC TAAGCCCCGT TGGGGACTTT CCGAAGAGGC ATGAAGGGAC 8940 

TTTCCAAGGC AGGCGTGGCC TGGGCGGGAC TGGGGAGTGG CGAGCCCTCA GATGCTGCAT 9000 

ATAAGCAGCT GCTTTCTGCC TGTACTGGGT CTCTCTGGTT AGACCAGATC TGAGCCTGGG 9060 
AGCTCTCTGG CTAGCTAGGG AACCCACTGC TTAAGCCTCA ATAAAGCTTG CCTTGAGTGC • 9120 

TTCAAGTAGT GTGTGCCCGT CTGTTGTGTG ACTCTGGTAT CTAGAGATCC CTCAGACCAT 9180 

TTTAGTCCGT GTGGAAAATC TCTAGCA 9207 
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CLAIMS: 

1. An isolated HTV-1 strain or a component, part, fragment or derivative thereof 
which is substantially non-pathogenic. 

2. A strain of HTV-1 according to claim 1 wherein said strain is capable of 
stimulating an immune response in a human or primate against at least one glycoprotein 
on HIV-1 whilst not substantially reducing proliferative responses and cytokine 
production to a mitogen in said human or primate compared to a healthy, non-infected 
human or primate subject 

3. A strain of HIV-1 according to claim 2 which carries one or more mutations in 
its genome resulting in the inability to direct synthesis of at least one polypeptide or 
protein from a. pathogenic strain of HTV-1 or results in reduced synthesis of said 
polypeptide or protein. 

4. A strain of HTV-1 according to claim 2 which carries one or more mutations in 
its genome resulting in the said genome directing synthesis of a truncated form of a 
polypeptide or protein from a pathogenic strain of HTV-1. 

5. A strain of HTV-1 according to claim 3 or 4 wherein said strain carries a 
mutation in its nef gene and/or long terminal repeat (LTR) region or in a functionally 
equivalent location in the HIV-1 genome. 

6. A strain of HTV-1 according to claim 5 wherein said strain is reactive to 
antibodies to gp41-45, gpl20 and/or gpl60 of HTV-1 and carries a deletion of at least 
ten contiguous nucleotides in a region corresponding to the nefgene and/or LTR region 
of pathogenic HIV-1. 

7. A strain of HIV-1 according to claim 6 wherein said strain is capable of inducing 
an immune response to at least one of gag, pol and/or env. 
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8. A strain of HIV-1 according to claim 7 wherein said strain carries a deletion in 
its genome of at least 10 nucleotides within the region from nucleotide 8787 to 
nucleotide 9709 using the nucleotide numbering of HIV-1 strain NL4-3. 

9. A strain of HIV-1 according to claim 8 wherein said strain carries a deletion in 
its genome of at least 10 nucleotides from within a region selected from the list 
consisting of: 

nucleotide (i) 8830-8862; 

(ii) 9009-9035; 

(iii) 9019-9029; and 

(iv) 9033-9049. 

10. A strain of HIV-1 according to claim 8 wherein said strain carries a deletion in 
its genome of at least 10 nucleotides from within a region selected from the list 
consisting of: 

nucleotide (v) 9281-9371; 

(vi) 9281-9362; 

(vii) 9105-9224; and 

(viii) 9271-9370. 

11. A strain of HIV-1 according to claim 8 wherein said strain carries a deletion in 
its genome of at least 10 nucleotides from within a region selected from the list 
consisting of: 

nucleotide (ix) 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 

(xii) 9112-9204. 



12. A strain of HIV-1 according to claim 8 wherein said strain carries a deletion in 
its genome of at least 10 nucleotides from within a region selected from the list 
consisting of: 
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nucleotide (xiii) 91 05-9224 

(xiv) 9389-9395; and 

(xv) 9281-9366. 

13. A strain of HTV-1 according to claim 8 wherein said deletion encompasses at 
least one of the following decanucleotides from the nef gene of HIV- 1 strain NL4-3: 



ATGGGTGGCA 


(SEQ 


ID NO:2) ; 


TGGGTGGCAA 


(SEQ 


ID NO 


:3) ; 


GGGTGGCAAG 


(SEQ 


ID N0:4) ; 


GGTGGCAAGT 


(SEQ 


ID NO 


:5) ; 


GTGGCAAGTG 


(SEQ 


ID NO: 6) ; 


TGGCAAGTGG 


(SEQ 


ID 


NO:7) 


GGCAAGTGGT 


(SEQ 


ID NO: 8) ; 


GCAAGTGGTC 


(SEQ ID NO: 9) ; 


CAAGTGGTCA 


(SEQ 


ID NO: 10) ; 


AAGTGGTCAA 


(SEQ 


ID 


NO: 11) 


AGTGGTCAAA 


(SEQ 


ID NO:12) ; 


GTGGTCAAAA 


(SEQ 


ID 


NO:13) 


TGGTCAAAAA 


(SEQ 


ID NO:14) ; 


GGTCAAAAAG 


(SEQ 


ID 


NO:15) 


GTCAAAAAGT 


(SEQ 


ID NO: 16) ; 


TCAAAAAGTA 


(SEQ 


ID 


NO: 17) 


CAAAAAGTAG 


(SEQ 


ID NO: 18) ; 


AAAAAGTAGT 


(SEQ ID NO 


:19) ; 


AAAAGTAGTG 


(SEQ 


ID NO:20) ; 


AAAGTAGTGT 


(SEQ 


ID 


NO: 21) 


AAGTAGTGTG 


(SEQ 


ID NO: 22) ; 


AGTAGTGTGA 


(SEQ 


ID 


NO:23) 


GTAGTGTGAT 


(SEQ 


ID NO:24) ; 


TAGTGTGATT 


(SEQ ID NO: 25) ; 


AGTGTGATTG 


(SEQ 


ID NO:26) ; 


GTGTGATTGG 


(SEQ 


ID 


NO: 27) 


TGTGATTGGA 


(SEQ 


ID NO: 28) ; 


GTGATTGGAT 


(SEQ 


ID 


NO:29) 


TGATTGGATG 


(SEQ 


ID NO: 30) ; 


GATTGGATGG 


(SEQ 


ID 


NO: 31) 


ATTGGATGGC 


(SEQ 


ID NO: 32) ; 


TTGGATGGCC 


(SEQ 


ID 


NO:33) 


TGGATGGCCT 


(SEQ 


ID NO:34) ; 


GGATGGCCTG 


(SEQ 


ID 


NO:35) 


GATGGCCTGC 


(SEQ 


ID NO: 36) ; 


ATGGCCTGCT 


(SEQ 


ID 


NO:37) 


TGGCCTGCTG 


(SEQ 


ID NO: 38) ; 


GGCCTGCTGT 


(SEQ 


ID 


NO:39) 


GCCTGCTGTA 


(SEQ 


ID NO: 40) ; 


CCTGCTGTAA 


(SEQ 


ID 


N0:41) 


CTGCTGTAAG 


(SEQ 


ID NO:42) ; 


TGCTGTAAGG 


(SEQ 


ID 


NO:43) 


GCTGTAAGGG 


(SEQ 


ID NO: 44) ; 


CTGTAAGGGA 


(SEQ 


ID 


NO:45) 


TGTAAGGGAA 


(SEQ 


ID NO: 46) ; 


GTAAGGGAAA 


(SEQ 


ID 


NO:47) 


TAAGGGAAAG 


(SEQ 


ID NO: 48) ; 


AAGGGAAAGA 


(SEQ 


ID 


NO:49) 


AGGGAAAGAA 


(SEQ 


ID NO: 50) ; 


GGGAAAGAAT 


(SEQ 


ID 


NO: 51) 


GGAAAGAATG 


(SEQ 


ID NO: 52) ; 


GAAAGAATGA 


(SEQ 


ID 


NO:53) 


AAAGAATGAG 


(SEQ 


ID NO: 54) ; 


AAGAATGAGA 


(SEQ 


ID 


NO:55) 


AGAATGAGAC 


(SEQ 


ID NO: 56) ; 


GAATGAGACG 


(SEQ 


ID 


NO: 57) 


AATGAGACGA 


(SEQ 


ID NO: 58) ; 


ATGAGACGAG 


(SEQ 


ID 


NO:59). 


TGAGACGAGC 


(SEQ 


ID NO: 60) ; 


GAGACGAGCT 


(SEQ 


ID 


NO: 61) 
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AGACGAGCTG 


(SEQ 


ID 


NO: 62) ; 


GACGAGCTGA 


(SEQ 


ID 


NO:63) 


ACGAGCTGAG 


(SEQ 


ID 


NO: 64) ; 


CGAGCTGAGC 


(SEQ 


ID 


NO:65) 


GAGCTGAGCC 


(SEQ 


ID 


NO: 66) ; 


AGCTGAGCCA 


(SEQ 


ID 


NO:67) 


GCTGAGCCAG 


(SEQ 


ID 


NO: 68) ; 


CTGAGCCAGC 


(SEQ 


ID 


NO:69) 


TGAGCCAGCA 


(SEQ 


ID 


NO: 70) ; 


GAGCCAGCAG 


(SEQ 


ID 


NO:71) 


AGCCAGCAGC 


(SEQ 


ID 


NO: 72) ; 


GCCAGCAGCA 


(SEQ 


ID 


NO:73) 


CCAGCAGCAG 


(SEQ 


ID 


NO: 74) ; 


CAGCAGCAGA 


(SEQ 


ID 


NO: 75) 


AGCAGCAGAT 


(SEQ 


ID 


NO:76) ; 


GCAGCAGATG 


(SEQ 


ID 


NO: 77) 


CAGCAGATGG 


(SEQ 


ID 


NO: 78) ; 


AGCAGATGGG 


(SEQ 


ID 


NO:79) 


GCAGATGGGG 


(SEQ 


ID 


NO: 80) ; 


CAGATGGGGT 


(SEQ 


ID 


NO:81) 


AGATGGGGTG 


(SEQ 


ID 


NO:82) ; 


GATGGGGTGG 


(SEQ 


ID 


NO: 83) 


ATGGGGTGGG 


(SEQ 


ID 


NO: 84) ; 


TGGGGTGGGA 


(SEQ 


ID 


NO:85) 


GGGGTGGGAG 


(SEQ 


ID 


NO: 86) ; 


GGGTGGGAGC 


(SEQ 


ID 


NO: 87) 


GGTGGGAGCA 


(SEQ 


ID 


NO:88) ; 


GTGGGAGCAG 


(SEQ 


ID 


NO:89) 


TGGGAGCAGT 


(SEQ 


ID 


NO:90) ; 


GGGAGCAGTA 


(SEQ 


ID 


NO: 91) 


.GGAGCAGTAT 


(SEQ 


ID 


NO: 92) ; 


GAGCAGTATC 


(SEQ 


ID 


NO:93) 


AGGAGTATCT 


(SEQ 


ID 


NO:94) ; 


GCAGTATCTC 


(SEQ 


ID 


NO:95) 


CAGTATCTCG 


(SEQ 


ID 


NO: 96) ; 


AGTATCTCGA 


(SEQ 


ID 


NO: 97) 


GTATCTCGAG 


(SEQ 


ID 


NO: 98) ; 


TATCTCGAGA 


(SEQ 


ID 


NO:99) 


ATCTCGAGAC 


(SEQ 


ID NO: 100) 


? TCTCGAGACC 


(SEQ ID 


NO: 101) 


CTCGAGACCT 


(SEQ 


ID 


NO: 102) 


; TCGAGACCTA 


(SEQ ID 


NO 


:103) , 




CGAGACCTAG 


(SEQ 


ID 


NO: 104) 


; GAGACCTAGA 


(SEQ ID 


NO: 105) , 




AGACCTAGAA 


(SEQ 


ID 


NO: 106) 


; GACCTAGAAA 


(SEQ ID 


NO 


:107) , 




ACCTAGAAAA 


(SEQ 


ID 


NO: 108) 


; CCTAGAAAAA 


(SEQ ID 


NO 


:109) , 




CTAGAAAAAC 


(SEQ 


ID 


NO: 110) , 


r TAGAAAAACA 


(SEQ ID 


NO 


:111) < 




AGAAAAACAT 


(SEQ 


ID 


NO: 112) , 


r GAAAAACATG 


(SEQ ID 


NO 


:113) , 




AAAAACATGG 


(SEQ 


ID 


NO:114) , 


AAAACATGGA 


(SEQ ID 


NO 


:115) , 




AAACATGGAG 


(SEQ 


ID 


NO: 116) , 


AACATGGAGC 


(SEQ ID 


NO 


:117) , 




ACATGGAGCA 


(SEQ 


ID 


NO: 118) , 


CATGGAGCAA 


(SEQ ID 


NO 


:119) , 




ATGGAGCAAT 


(SEQ 


ID 


NO: 120) , 


TGGAGCAATC 


(SEQ ID 


NO 


:121) , 




GGAGCAATCA 


(SEQ 


ID 


NO:122) j 


GAGCAATCAC 


(SEQ ID 


NO 


:123) ; 




AGCAATCACA 


(SEQ 


ID 


NO:124) , 


GCAATCACAA 


(SEQ ID 


NO: 125) , 




CAATCACAAG 


(SEQ 


ID NO:126) , 


AATCACAAGT 


(SEQ ID 


NO 


:127) , 




ATCACAAGTA 


(SEQ 


ID 


NO: 128) , 


TCACAAGTAG 


(SEQ ID 


NO 


:129) , 




CACAAGTAGC 


(SEQ 


ID 


NO: 130) , 


ACAAGTAGCA 


(SEQ ID 


NO 


:131) t 




CAAGTAGCAA 


(SEQ 


ID 


NO: 13 2) , 


AAGTAGCAAT 


(SEQ ID 


NO 


:133) , 




AGTAGCAATA 


(SEQ 


ID 


NO: 134) , 


GTAGCAATAC 


(SEQ ID 


NO 


:135) , 




TAGCAATACA 


(SEQ 


ID 


NO: 136) ; 


AGCAATACAG 


(SEQ ID 


NO 


:137) , 
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GCAATACAGC (SEQ ID NO: 13 8) 
AATACAGCAG (SEQ ID NO: 140) 
TACAGCAGCT (SEQ ID NO: 142) 
CAGCAGCTAA (SEQ ID NO: 144) 
GCAGCTAACA (SEQ ID NO: 14 6) 
AGCTAACAAT (SEQ ID NO: 14 8) 
CTAACAATGC (SEQ ID NO: 150) 
AACAATGCTG (SEQ ID NO: 152) 
CAATGCTGCT (SEQ ID NO: 154) 
ATGCTGCTTG (SEQ ID NO: 156) 
GCTGCTTGTG (SEQ ID NO: 158) 
TGCTTGTGCC (SEQ ID NO: 160) 
CTTGTGCCTG (SEQ ID NO: 162) 
TGTGCCTGGC (SEQ ID NO: 164) 
TGCCTGGCTA (SEQ ID NO; 166) 
CCTGGCTAGA (SEQ ID NO: 168) 
TGGCTAGAAG (SEQ ID NO : 170 ) , 

GCTAGAAGCA (SEQ ID NO: 172) ; 

TAGAAGCACA (SEQ ID NO: 174) ; 

GAAGCACAAG (SEQ ID NO:176); 

AGCACAAGAG (SEQ ID NO: 178); 

CACAAGAGGA (SEQ ID NO:180)j 

CAAGAGGAGG (SEQ ID NO:182); 

AGAGGAGGAA (SEQ ID NO: 184); 

AGGAGGAAGA (SEQ ID NO: 186) ; 

GAGGAAGAGG (SEQ ID NO:188); 

GGAAGAGGTG (SEQ ID NO: 190); 

AAGAGGTGGG (SEQ ID NO: 192); 

GAGGTGGGTT (SEQ ID NO: 194); 

GGTGGGTTTT (SEQ ID NO: 196); 

TGGGTTTTCC (SEQ ID NO: 198); 

GGTTTTCCAG (SEQ ID NO: 2 00); 

TTTTCCAGTC (SEQ ID NO: 2 02) ; 

TTCCAGTCAC (SEQ ID NO: 2 04); 

CCAGTCACAC (SEQ ID NO:206); 

AGTCACACCT ( SEQ ID NO : 2 0 8 ) ; 

TCACACQTCA (SEQ ID NO:210) ; 
ACACCTCAGG (SEQ ID NO: 212) ; 



CAATACAGCA (SEQ ID NO: 139); 

ATACAGCAGC (SEQ ID NO: 141); 

ACAGCAGCTA (SEQ ID NO: 143); 

AGCAGCTAAC (SEQ ID NO: 145); 

CAGCTAACAA (SEQ ID NO: 147); 

GCTAACAATG (SEQ ID NO: 149); 

TAACAATGCT (SEQ ID NO: 151); 

ACAATGCTGC (SEQ ID NO: 153); 

AATGCTGCTT (SEQ ID NO: 155); 

TGCTGCTTGT (SEQ ID NO: 157); 

CTGCTTGTGC (SEQ ID NO: 159); 

GCTTGTGCCT (SEQ ID NO: 161); 

TTGTGCCTGG (SEQ ID NO: 163) ; 

GTGCCTGGCT (SEQ ID NO: 165) ; 

GCCTGGCTAG (SEQ ID NO:167) ; 

CTGGCTAGAA (SEQ ID NO: 169) ; 
GGCTAGAAGC (SEQ ID NO: 171) ; 
CTAGAAGCAC (SEQ ID NO: 173) ; 
AGAAGCACAA (SEQ ID NO:175) ; 
AAGCACAAGA (SEQ ID NO:177); 
GCACAAGAGG (SEQ ID NO:179) ; 
ACAAGAGGAG (SEQ ID NO: 181) ; 
AAGAGGAGGA (SEQ ID NO:183); 
GAGGAGGAAG (SEQ ID NO:185); 
GGAGGAAGAG (SEQ ID NO: 187) ; 
AGGAAGAGGT (SEQ ID NO:189) ; 
GAAGAGGTGG (SEQ ID NO:191); 
AGAGGTGGGT (SEQ ID NO:193); 
AGGTGGGTTT (SEQ ID NO:195); 
GTGGGTTTTC (SEQ ID NO:197); 
GGGTTTTCCA (SEQ ID NO: 199); 
GTTTTCCAGT (SEQ ID NO:201); 
TTTCCAGTCA (SEQ ID NO: 203) ; 
TCCAGTCACA (SEQ ID NO:205) ; 
CAGTCACACC (SEQ ID NO: 207) ; 
GTCACACCTC (SEQ ID NO: 209) ; 
CACACCTCAG (SEQ ID NO:211) ; 
CACCTCAGGT (SEQ ID NO: 213) ; 
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ACCTCAGGTA 


(SEQ 


ID 


NO 


:214) 


CTCAGGTACC 


(SEQ 


ID 


NO 


:216) 


CAGGTACCTT 


(SEQ 


ID 


NO 


:218) 


GGTACCTTTA 


(SEQ 


ID 


NO 


:220) 


TACCTTTAAG 


(SEQ 


ID 


NO 


:222) 


CCTTTAAGAC 


(SEQ 


ID 


NO 


:224) 


TTTAAGACCA 


(SEQ 


ID 


NO 


:226) 


TAAGACCAAT 


(SEQ 


ID 


NO: 


228) 


AGACCAATGA 


(SEQ 


ID 


NO: 


230) 


ACCAATGACT 


(SEQ 


ID 


NO: 


232) 


CAATGACTTA 


(SEQ 


ID 


NO: 


234) 


ATGACTTACA 


(SEQ 


ID 


NO: 


236) 


GACTTACAAG 


(SEQ 


ID 


NO: 


238) 


CTTACAAGGC 


(SEQ 


ID 


NO: 


240) 


TACAAGGCAG 


(SEQ 


ID 


NO: 


242) 


CAAGGCAGCT 


(SEQ 


ID 


NO: 


244) 


AGGCAGCTGT 


(SEQ 


ID 


NO: 


246) 


GCAGCTGTAG 


(SEQ 


ID 


NO: 


248) 


AGCTGTAGAT 


(SEQ 


ID 


NO: 


250) 


CTGTAGATCT 


(SEQ 


ID 


NO: 


252) 


GTAGATCTTA 


(SEQ 


ID 


NO: 


254) 


AGATCTTAGC 


(SEQ 


ID 


NO: 


256) 


ATCTTAGCCA 


(SEQ 


ID 


NO: 


258) 


CTTAGCCACT 


(SEQ 


ID 


NO: 


260) 


TAGCCACTTT 


(SEQ 


ID 


NO: 


262) 


GCCACTTTTT 


(SEQ 


ID 


NO: 


264) 


CACTTTTTAA 


(SEQ 


ID 


NO: 


266) 


CTTTTTAAAA 


(SEQ 


ID 


NO: 


268) 


TTTTAAAAGA 


(SEQ 


ID 


NO: 


270) 


TTAAAAGAAA 


(SEQ 


ID 


NO: 


272) 


AAAAGAAAAG 


(SEQ 


ID 


NO: 


274) 


AAGAAAAGGG 


(SEQ 


ID 


NO: 


276) 


GAAAAGGGGG 


(SEQ 


ID 


NO: 


278) 


AAAGGGGGGA 


(SEQ 


ID 


NO: 


280) 


AGGGGGGACT 


(SEQ 


ID 


NO: 


282) 


GGGGGACTGG 


(SEQ 


ID 


NO: 


284) 


GGGACTGGAA 


(SEQ 


ID 


NO: 


286) 


GACTGGAAGG 


(SEQ 


ID 


NO: 


288) 
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CCTCAGGTAC 


(SEQ 


ID 


NO 


:215) 


TCAGGTACCT 


(SEQ 


ID 


NO 


:217) 


AGGTACCTTT 


(SEQ 


ID 


NO 


:219) 


GTACCTTTAA 


(SEQ 


ID 


NO 


:221) 


ACCTTTAAGA 


(SEQ 


ID 


NO 


:223) 


CTTTAAGACC 


(SEQ 


ID 


NO 


:225) 


TTAAGACCAA 


(SEQ 


ID 


NO: 


227) 


AAGACCAATG 


(SEQ 


ID 


NO: 


229) 


GACCAATGAC 


(SEQ 


ID 


NO: 


231) 


CCAATGACTT 


(SEQ 


ID 


NO: 


233) 


AATGACTTAC 


(SEQ 


ID 


NO: 


235) 


TGACTTACAA 


(SEQ 


ID 


NO: 


237) 


ACTTACAAGG 


(SEQ 


ID 


NO: 


239) 


TTACAAGGCA 


(SEQ 


ID 


NO: 


241) 


ACAAGGCAGC 


(SEQ 


ID 


NO: 


243) 


AAGGCAGCTG 


(SEQ 


ID 


NO: 


245) 


GGCAGCTGTA 


(SEQ 


ID 


NO: 


247) 


CAGCTGTAGA 


(SEQ 


ID 


NO: 


249) 


GCTGTAGATC 


(SEQ 


ID 


NO: 


251) 


TGTAGATCTT 


(SEQ 


ID 


NO: 


253) 


TAGATCTTAG 


(SEQ 


ID 


NO: 


255) 


GATCTTAGCC 


(SEQ 


ID 


NO: 


257) 


TCTTAGCCAC 


(SEQ 


ID 


NO: 


259) 


TTAGCCACTT 


(SEQ 


ID 


NO: 


261) 


AGCCACTTTT 


(SEQ 


ID 


NO: 


263) 


CCACTTTTTA 


(SEQ 


ID 


NO: 


265) 


ACTTTTTAAA 


(SEQ 


ID 


NO: 


267) 


TTTTTAAAAG 


(SEQ 


ID 


NO: 


269) 


TTTAAAAGAA 


(SEQ 


ID 


NO: 


271) 


TAAAAGAAAA 


(SEQ 


ID 


NO: 


273) 


AAAGAAAAGG 


(SEQ 


ID 


NO: 


275) 


AGAAAAGGGG 


(SEQ 


ID 


NO: 


277) 


AAAAGGGGGG 


(SEQ 


ID 


NO: 


279) 


AAGGGGGGAC 


(SEQ 


ID 


NO: 


281) 


GGGGGGACTG 


(SEQ 


ID 


NO: 


283) 


GGGGACTGGA 


(SEQ 


ID 


NO: 


285) 


GGACTGGAAG 


(SEQ 


ID 


NO: 


287) 


ACTGGAAGGG 


(SEQ 


ID 


NO: 


289) 
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CTGGAAGGGC (SEQ ID NO: 2 90) 
GGAAGGGCTA (SEQ ID NO: 292) 
AAGGGCTAAT (SEQ ID NO: 2 94) 
GGGCTAATTC (SEQ ID NO: 296) 
GCTAATTCAC (SEQ ID NO: 298) 
TAATTCACTC (SEQ ID NO: 300) 
ATTCACTCCC (SEQ ID NO: 302) 
TCACTCCCAA (SEQ ID NO: 304) 
ACTCCCAAAG (SEQ ID NO: 306) 
TCCCAAAGAA (SEQ ID NO: 308) 
CCAAAGAAGA (SEQ ID NO: 310) 
AAAGAAGACA (SEQ ID NO: 312) 
AGAAGACAAG (SEQ ID NO: 3 14) 
AAGACAAGAT (SEQ ID NO: 316) 
GACAAGATAT (SEQ ID NO: 318) 
, CAAGATATCC (SEQ ID NO: 320) 
. AGATATCCTT (SEQ ID NO: 322) 
ATATCCTTGA (SEQ ID NO:324) 
■ ATCCTTGATC (SEQ ID NO:326) 
; CCTTGATCTG (SEQ ID NO: 328) 
TTGATCTGTG (SEQ ID NO:330) 
GATCTGTGGA (SEQ ID NO:332) 
TCTGTGGATC (SEQ ID NO: 334) 
TGTGGATCTA (SEQ ID NO:336) , 
TGGATCTACC (SEQ ID NO:338) , 
GATCTACCAC (SEQ ID 110:340); 
TCTACCACAC (SEQ ID NO:342)j 
TACCACACAC • (SEQ ID NO: 344) ; 
CCACACACAA (SEQ ID NO:346) ; 
ACACACAAGG (SEQ ID NO:348); 
ACACAAGGCT (SEQ ID NO:350); 
ACAAGGCTAC (SEQ ID NO: 3 52); 
AAGGCTACTT (SEQ ID NO: 3 54) j 
GGCTACTTCC (SEQ ID NO: 3 56); 
CTACTTCCCT ( SEQ ID NO : 3 5 8 ) ; 
ACTTCCCTGA (SEQ ID NO: 360) ; 
TTCCCTGATT (SEQ ID NO: 3 62); 
CCCTGATTGG (SEQ ID NO: 3 64); 
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TGGAAGGGCT ( SEQ ID NO : 2 9 1 ) ; 
GAAGGGCTAA (SEQ ID NO: 2 93); 
AGGGCTAATT (SEQ ID NO:295);. 
GGCTAATTCA (SEQ ID NO:297) ; 
CTAATTCACT (SEQ ID NO: 299); 
AATTCACTCC (SEQ ID NO: 3 01) ; 
TTCACTCCCA (SEQ ID NO:303); 
CACTCCCAAA (SEQ ID NO:305); 
CTCCCAAAGA (SEQ ID NO: 3 07) ; 
CCCAAAGAAG (SEQ ID NO: 3 09); 
CAAAGAAGAC (SEQ ID NO:311) ; 
AAGAAGACAA (SEQ ID NO:313) ; 
GAAGACAAGA (SEQ ID NO:315) ; 
AGACAAGATA (SEQ ID NO: 3 17) ; 
ACAAGATATC (SEQ ID NO:319); 
AAGATATCCT (SEQ ID NO: 321); 
GATATCCTTG (SEQ ID NO: 323) ; 
TATCCTTGAT (SEQ ID NO : 3 2 5 ) ; 
TCCTTGATCT (SEQ ID NO: 327) ; 
CTTGATCTGT (SEQ ID NO: 329) ; 
TGATCTGTGG (SEQ ID NO: 331); 
ATCTGTGGAT (SEQ ID NO: 333) ; 
CTGTGGATCT (SEQ ID NO:335) ; 
GTGGATCTAC (SEQ ID NO:337); 
GGATCTACCA (SEQ ID NO:339) ; 
ATCTACCACA (SEQ ID NO: 341) ; 
CTACCACACA (SEQ ID NO: 343) ; 
ACCACACACA (SEQ ID NO:345) ; 
CACACACAAG (SEQ ID NO:347) ; 
CACACAAGGC (SEQ ID NO:349) ; 
CACAAGGCTA (SEQ ID NO:351) ; 
CAAGGCTACT (SEQ ID NO: 3 53) ; 
AGGCTACTTC (SEQ ID NO:355) ; 
GCTACTTCCC (SEQ ID NO: 357) ; 
TACTTCCCTG ( SEQ ID NO : 3 5 9 ) ; 
CTTCCCTGAT (SEQ ID NO: 361) ; 
TCCCTGATTG (SEQ ID NO: 363) ; 
CCTGATTGGC (SEQ ID NO: 3 65) ; 
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CTGATTGGCA (SEQ ID NO: 366) 
GATTGGCAGA (SEQ ID NO: 368) 
TTGGCAGAAC (SEQ ID NO: 370) 
GGCAGAACTA (SEQ ID NO: 3 72) 
CAGAACTACA (SEQ ID NO: 3 74) 
GAACTACACA (SEQ ID NO: 3 76) 
ACTACACACC (SEQ ID NO: 378) 
TACACACCAG (SEQ ID NO: 380) 
CACACCAGGG (SEQ ID NO: 3 82) 
CACCAGGGCC (SEQ ID NO: 3 84) 
CCAGGGCCAG (SEQ ID NO: 386) 
AGGGCCAGGG (SEQ ID NO:388) 
GGCCAGGGGT (SEQ ID NO:390) 
CCAGGGGTCA (SEQ ID NO: 392) 
AGGGGTCAGA (SEQ ID NO: 3 94) 
GGGTCAGATA (SEQ ID NO:396) 
GTCAGATATC (SEQ ID NO: 398) 
CAGATATCCA (SEQ ID NO: 400) 
GATATCCACT (SEQ ID NO:402) J 
TATCCACTGA (SEQ ID NO: 404) ; 
TCCACTGACC (SEQ ID NO:406)j 
CACTGACCTT (SEQ ID NO:408) ; 
CTGACCTTTG (SEQ ID NO: 410) ; 
GACCTTTGGA (SEQ ID NO: 4 12) ; 
CCTTTGGATG (SEQ ID NO: 414) ; 

TTTGGATGGT (SEQ ID NO:416) ; 

TGGATGGTGC (SEQ ID NO: 418) ; 

GATGGTGCTA • (SEQ ID NO:420); 

TGGTGCTACA (SEQ ID NO: 422) ; 

GTGCTACAAG (SEQ ID NO: 424) ; 

GCTACAAGCT (SEQ ID NO:426) ; 

TACAAGCTAG (SEQ ID NO:428) ; 

CAAGCTAGTA (SEQ ID NO:430) ; 

AGCTAGTACC (SEQ ID NO: 432) ; 

CTAGTACCAG (SEQ ID NO: 434) ; 

AGTACCAGTT (SEQ ID NO:436) ; 

TACCAGTTGA { SEQ ID NO : 4 3 8 ) ; 

CCAGTTGAGC (SEQ ID NO: 440) ; 
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TGATTGGCAG (SEQ ID NO: 367) 
ATTGGCAGAA (SEQ ID NO: 369) 
TGGCAGAACT (SEQ ID NO: 371) 
GCAGAACTAC (SEQ ID NO: 3 73) 
AGAACTACAC (SEQ ID NO: 375) 
AACTACACAC (SEQ ID NO: 377) 
CTACACACCA (SEQ ID NO:379) 
ACACACCAGG (SEQ ID NO: 381) 
ACACCAGGGC (SEQ ID NO: 383) 
ACCAGGGCCA (SEQ ID NO:385) 
CAGGGCCAGG (SEQ ID NO: 387) 
GGGCCAGGGG (SEQ ID NO: 389) 
GCCAGGGGTC (SEQ- ID NO: 3 91) 
CAGGGGTCAG (SEQ ID NO: 3 93) 
GGGGTCAGAT (SEQ ID NO: 3 95) 
GGTCAGATAT (SEQ ID NO:397) 
TCAGATATCC (SEQ ID NO: 3 99) . 

AGATATCCAC (SEQ ID NO:401) , 

ATATCCACTG (SEQ ID NO:403) l 

ATCCACTGAC (SEQ ID NO: 405) , 
CCACTGACCT (SEQ ID NO: 4 07) ; 

ACTGACCTTT (SEQ ID NO: 4 09) ; 

TGACCTTTGG (SEQ ID NO: 4 11) ; 

ACCTTTGGAT (SEQ ID NO: 413) j 

CTTTGGATGG (SEQ ID NO: 4 15) \ 

TTGGATGGTG (SEQ ID NO:417) ; 

GGATGGTGCT (SEQ ID NO:419) ; 

ATGGTGCTAC (SEQ ID NO: 421) ; 

GGTGCTACAA (SEQ ID NO: 423) ; 

TGCTACAAGC (SEQ ID NO: 425) ; 

CTACAAGCTA (SEQ ID NO:427) ; 

ACAAGCTAGT (SEQ ID NO:429) ; 

AAGCTAGTAC (SEQ ID NO:431); 

GCTAGTACCA (SEQ ID NO:433) ; 

TAGTACCAGT (SEQ ID NO: 43 5) ; 

GTACCAGTTG (SEQ ID NO: 437) ; 

ACCAGTTGAG (SEQ ID NO: 43 9) ; 

CAGTTGAGCC ( SEQ ID NO : 4 4 1 ) ; 
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AGTTGAGCCA (SEQ ID NO: 442); 
TTGAGCCAGA (SEQ ID NO: 444) ; 
GAGCCAGATA (SEQ ID NO: 44 6) ; 
GCCAGATAAG ( SEQ ID NO : 44 8 ) ; 
CAGATAAGGT (SEQ ID NO:450) ; 
GATAAGGTAG (SEQ ID NO:452) ; 
TAAGGTAGAA (SEQ ID NO: 4 54); 
AGGTAGAAGA (SEQ ID NO:456) ; 
GTAGAAGAGG (SEQ ID NO:458) ; 
AGAAGAGGCC (SEQ ID NO:460) ; 
AAGAGGCCAA (SEQ ID NO:4€2) ; 
GAGGCCAATA (SEQ ID NO: 464) ; 
GGCCAATAAA (SEQ ID NO:466) ; 
CCAATAAAGG (SEQ ID NO:468) ; 
AATAAAGGAG (SEQ ID NO: 470) ; 
TAAAGGAG AG (SEQ ID NO:472) ; 
AAGGAGAGAA (SEQ ID NO: 4 74) ; 
GGAGAGAACA (SEQ ID NO: 476) ; 
AGAGAACACC (SEQ ID NO: 4 78); 
AGAAGACCAG (SEQ ID NO:480) ; 
AACACCAGCT (SEQ ID NO: 4 82) ; 
CACCAGCTTG (SEQ ID NO: 4 84) ; 
CCAGCTTGTT (SEQ ID NO:486) ; 
AGCTTGTTAC (SEQ ID NO: 4 88); 
CTTGTTACAC (SEQ ID NO:490) ; 
TGTTACACCC (SEQ ID NO: 4 92); 
TTACACCCTG (SEQ ID NO: 4 94); 
ACACCCTGTG (SEQ ID NO:496) ; 
ACCCTGTGAG ( SEQ ID NO : 4 9 8 ) ; 
CCTGTGAGCC (SEQ ID NO: 500) ; 
TGTGAGCCTG (SEQ ID NO: 5 02); 
TGAGCCTGCA (SEQ ID NO: 5 04); 
AGCCTGCATG (SEQ ID NO: 506) ; 
CCTGCATGGA (SEQ ID NO: 5 08); 
TGCATGGAAT (SEQ ID NO: 510); 
CATGGAATGG (SEQ ID NO:512); 
TGGAATGGAT (SEQ ID NO:514); 
GAATGGATGA (SEQ ID NO:516); 
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GTTGAGCCAG 


(SEQ 


ID 


NO 


:443) 


TGAGCCAGAT 


(SEQ 


ID 


NO 


:445) 


AGCCAGATAA 


(SEQ 


ID 


NO 


:447) 


CCAGATAAGG 


(SEQ 


ID 


NO 


:449) 


AGATAAGGTA 


(SEQ 


ID 


NO 


:451) 


ATAAGGTAGA 


(SEQ 


ID 


NO 


:453) 


AAGGTAGAAG 


(SEQ 


ID 


NO 


:455) 


GGTAGAAGAG 


(SEQ 


ID 


NO 


:457) 


TAGAAGAGGC 


(SEQ 


ID 


NO 


:459) 


GAAGAGGCCA 


(SEQ 


ID 


NO 


:461) 


AGAGGCCAAT 


(SEQ 


ID 


NO 


:463) 


AGGCCAATAA 


(SEQ 


ID 


NO 


:465) 


GCCAATAAAG 


(SEQ 


ID 


NO 


:467) 


CAATAAAGGA 


(SEQ 


ID 


NO 


:469) 


ATAAAGGAGA 


(SEQ 


ID 


NO 


:471) 


AAAGGAGAGA 


(SEQ 


ID 


NO. 


473) 


AGGAGAGAAC. 


(SEQ 


ID 


NO: 


475) 


GAGAGAACAC 


(SEQ 


ID 


NO: 


477) 


GAGAACACCA 


(SEQ 


ID 


NO: 


479) 


GAACACCAGC 


(SEQ 


ID 


NO: 


481) 


ACACCAGCTT 


(SEQ 


ID 


NO: 


483) 


ACCAGCTTGT 


(SEQ 


ID 


NO: 


485) 


CAGCTTGTTA 


(SEQ 


ID 


NO: 


487) 


GCTTGTTACA 


(SEQ 


ID 


NO: 


489) 


TTGTTACACC 


(SEQ 


ID 


NO: 


491) 


GTTACACCCT 


(SEQ 


ID 


NO: 


493) 


TACACCCTGT 


(SEQ 


ID 


NO: 


495) 


CACCCTGTGA 


(SEQ 


ID 


NO: 


497) 


CCCTGTGAGC 


(SEQ 


ID 


NO: 


499) 


CTGTGAGCCT 


(SEQ 


ID 


NO: 


501) 


GTGAGCCTGC 


(SEQ 


ID 


NO: 


503) 


GAGCCTGCAT 


(SEQ 


ID 


NO: 


505) 


GCCTGCATGG 


(SEQ 


ID 


NO: 


507) 


CTGCATGGAA 


(SEQ. 


ID 


NO: 


509) 


GCATGGAATG 


(SEQ 


ID 


NO: 


511) 


ATGGAATGGA 


(SEQ 


ID 


NO: 


513) 


GGAATGGATG 


(SEQ 


ID 


NO: 


515) 


AATGGATGAC 


(SEQ 


ID 


NO: 


517) 
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ATGGATGACC (SEQ ID NO: 518) 
GGATGACCCT (SEQ ID NO: 520) 
ATGACCCTGA (SEQ ID NO: 522) 
GACCCTGAGA (SEQ ID NO: 524) 
CCCTGAGAGA (SEQ ID NO: 526) 
CTGAGAGAGA (SEQ ID NO: 528) 
GAGAGAGAAG (SEQ ID NO: 530) 
GAGAGAAGTG (SEQ ID NO: 532) 
GAGAAGTGTT (SEQ ID NO: 534) 
GAAGTGTTAG (SEQ ID NO: 53 6) 
AGTGTTAGAG (SEQ ID NO: 53 8) 
TGTTAGAGTG (SEQ ID NO: 540) 
TTAGAGTGGA (SEQ ID NO: 542) 
AGAGTGGAGG (SEQ ID NO: 544) 
AGTGGAGGTT (SEQ ID NO: 546) 
TGGAGGTTTG (SEQ ID NO: 54 8) 
GAGGTTTGAC (SEQ ID NO: 550) 
GGTTTGACAG ( SEQ ID NO : 5 5 2 ) 
TTTGACAGCC (SEQ ID NO: 554) 
TGACAGCCGC (SEQ ID NO:556) \ 
ACAGCCGCCT (SEQ ID NO: 558) ; 
AGCCGCCTAG (SEQ ID NO: 560) j 
CCGCCTAGCA (SEQ ID NO: 562) ; 
GCCTAGCATT (SEQ ID NO: 564) ; 
CTAGCATTTC (SEQ ID NO:566) ; 
AGCATTTCAT (SEQ ID NO:568) ; 
CATTTCATCA (SEQ ID NO:570); 
TTTCATCACG (SEQ ID NO: 572) ; 
TCATCACGTG (SEQ ID NO: 574) ; 
ATCACGTGGC (SEQ ID NO: 576) ; 
CACGTGGCCC (SEQ ID NO: 578) ; 
CGTGGCCCGA (SEQ ID NO: 580) ; 
TGGCCCGAGA (SEQ ID NO: 582); 
GCCCGAGAGC (SEQ ID NO: 5 84); 
CCGAGAGCTG (SEQ ID, NO: 586); 
GAGAGCTGCA (SEQ ID NO: 588); 
GAGCTGCATC ( SEQ ID NO: 590) ; 
GCTGCATCCG (SEQ ID NO: 5 92); 



195 - 

TGGATGACCC (SEQ ID NO: 519); 
GATGACCCTG (SEQ ID NO: 521); 
TGACCCTGAG (SEQ ID NO: 523); 
ACCCTGAGAG (SEQ ID NO: 525); 
CCTGAGAGAG (SEQ ID NO: 527); 
TGAGAGAGAA (SEQ ID NO:529); 
AGAGAGAAGT (SEQ ID NO:531); 
AGAGAAGTGT (SEQ ID NO: 533); 
AGAAGTGTTA (SEQ ID NO: 535) ; 
AAGTGTTAGA (SEQ ID NO: 537); 
GTGTTAGAGT (SEQ ID NO: 539) ; 
GTTAGAGTGG (SEQ ID NO: 541) ; 
TAGAGTGGAG (SEQ ID NO: 543); 
GAGTGGAGGT (SEQ ID NO: 545) ; 
GTGGAGGTTT (SEQ ID NO: 547) ; 
GGAGGTTTGA (SEQ ID NO: 549) ; 
AGGTTTGACA (SEQ ID NO: 551) ; 
GTTTGACAGC (SEQ ID NO: 553); 
TTGACAGCCG (SEQ ID NO: 555) ; 
GACAGCCGCC (SEQ ID NO: 557) ; 
CAGCCGCCTA (SEQ ID NO: 559) ; 
GCCGCCTAGC (SEQ ID NO: 561) ; 
CGCCTAGCAT (SEQ ID NO: 563) ; 
CCTAGCATTT (SEQ ID NO: 565) ; 
TAGCATTTCA (SEQ ID NO: 567) ; 
GCATTTCATC (SEQ ID NO: 569) ; 
ATTTCATCAC (SEQ ID NO: 571) ; 
TTCATCACGT (SEQ ID NO: 573) ; 
CATCACGTGG (SEQ ID NO:575); 
TCACGTGGCC (SEQ ID NO: 577) ; 
ACGTGGCCCG (SEQ ID NO:579); 
GTGGCCCGAG (SEQ ID NO: 581) ; 
GGCCCGAGAG (SEQ ID NO: 583) ; 
CCCGAGAGCT (SEQ ID NO:585); 
CGAGAGCTGC (SEQ ID NO: 587) ; 
AGAGCTGCAT (SEQ ID NO: 589) ; 
AGCTGCATCC (SEQ ID NO: 591) ; 
CTGCATCCGG (SEQ ID NO: 593) ; 
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TGCATCCGGA 


(SEQ 


ID 


NO: 594) ; 


GCATCCGGAG 


(SEQ 


ID 


NO: 


595) ; 


CATCCGGAGT 


(SEQ 


ID 


NO: 596) ; 


ATCCGGAGTA 


(SEQ 


ID 


NO: 


597) ; 


TCCGGAGTAC 


(SEQ 


ID NO: 598) ; 


CCGGAGTACT 


(SEQ 


ID 


NO: 


599) ; 


CGGAGTACTT 


(SEQ 


ID 


NO: 600) ; 


GGAGTACTTC 


(SEQ 


ID 


NO: 


601) ; 


GAGTACTTCA 


(SEQ 


ID 


NO: 602) ; 


AGTACTTCAA 


(SEQ 


ID 


NO: 


603) ; 


GTACTTCAAG 


(SEQ 


ID 


NO: 604) ; 


TACTTCAAGA 


(SEQ 


ID 


NO: 


605) ; 


ACTTCAAGAA 


(SEQ 


ID 


NO: 606) ; 


CTTCAAGAAC 


(SEQ 


ID 


NO: 


607) ; 


TTCAAGAACT 


(SEQ 


ID 


NO: 608) ; 


TCAAGAACTG 


(SEQ 


ID 


NO: 


609) ; 


CAAGAACTGC 


(SEQ 


ID 


NO: 610) ; 


AAGAACTGCT 


(SEQ 


ID 


NO: 


611) / 


AGAACTGCTG 


(SEQ 


ID NO: 612) ; 


GAACTGCTGA 


(SEQ 


ID 


NO: 


613) . 



14. A strain of HTVM according to claim 8 wherein said deletion encompasses at 
least one of the following decanucleotides from the LTR region of HIV- 1 strain NL4-3: 



GCTTTTTfiPP 

X X X X 1\JWU 


(SEQ 


ID NO: 652) ; 


CTTTTTGCCT 


(SEQ 


ID 


NO: 653) 


TTTTTGCCTG 


(SEQ 


ID NO: 654) ; 


111 lbULlul 




ID 


NO: 655) 


TTTGCCTGTA 


(SEQ 


ID NO: 656) ; 






ID 


NO: 657) 


TGCCTGTACT 


(SEQ ID NO: 658) ; 






ID 


NO:659) 


CCTGTACTGG 


(SEQ 


ID NO: 660) ; 


w X w X AVh* X www 




ID 


NO: 661) 


TGTACTGGGT 


(SEQ ID NO: 662) ; 


GTACTGGGTC 


(SEQ 


ID 


NO: 663) 


TACTGGGTCT 


(SEQ 


ID NO: 664) ; 


ACTGGGTCTC 


(SEQ 


ID 


NO: 665) 


CTGGGTCTCT 


(SEQ 


ID NO: 666) ; 


TGGGTCTCTC 


(SEQ 


ID 


NO:667) 


GGGTCTCTCT 


(SEQ 


ID NO: 668) ; 


GGTCTCTCTG 


(SEQ 


ID 


NO: 669) 


GTCTCTCTGG 


(SEQ 


ID NO: 670) ; 


TCTCTCTGGT 


(SEQ 


ID 


NO: 671) 


CTCTCTGGTT 


(SEQ 


ID NO: 672) ; 


TCTCTGGTTA 


(SEQ 


ID 


NO: 673) 


CTCTGGTTAG 


(SEQ 


ID NO: 674) ; 


TCTCTGGTTA 


(SEQ 


ID 


NO: 675) 


CTGGTTAGAC 


(SEQ 


ID NO: 676) ; 


TGGTTAGACC 


(SEQ 


ID 


NO: 677) 


GGTTAGACCA 


(SEQ 


ID NO: 678) ; 


GTTAGACCAG 


(SEQ 


ID 


NO: 679) 


TTAGACCAGA 


(SEQ 


ID NO: 680) ; 


TAGACCAGAT 


(SEQ 


ID.NO:681) 


AGACCAGATC 


(SEQ 


ID NO: 682) ; 


GACCAGATCT 


(SEQ 


ID NO: 683) 


ACCAGATCTG 


(SEQ 


ID NO: 684) ; 


CCAGATCTGA 


(SEQ 


ID 


NO: 685) 


CAGATCTGAG 


(SEQ 


ID NO: 686) ; 


AGATCTGAGC 


(SEQ 


ID 


NO:687) 


GATCTGAGCC 


(SEQ 


ID NO: 688) ; 


ATCTGAGCCT 


(SEQ 


ID 


NO:689) 


TCTGAGCCTG 


(SEQ 


ID NO: 690) ; 


CTGAGCCTGG 


(SEQ 


ID 


NO:691) 


TGAGCCTGGG 


(SEQ 


ID NO: 692) ; 


GAGCCTGGGA 


(SEQ 


ID 


NO: 693) 


AGCCTGGGAG 


(SEQ 


ID NO: 694) ; 


GCCTGGGAGC 


(SEQ 


ID 


NO:695) 


CCTGGGAGCT 


(SEQ 


ID NO.-696) ; 


CTGGGAGCTC 


(SEQ 


ID 


NO:697) , 


TGGGAGCTCT 


(SEQ 


ID NO: 698) ; 


GGGAGCTCTC 


(SEQ 


ID 


NO:699) , 
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GGAGCTCTCT 


(SEQ 


ID, NO: 700) 


; GAGCTCTCTG 


(SEQ 


ID 


NO:701) 


AGCTCTCTGG 


(SEQ 


ID NO: 702) 


; GCTCTCTGGC 


(SEQ 


ID 


NO: 703) 


CTCTCTGGCT 


(SEQ 


ID NO:704) 


; TCTCTGGCTA 


(SEQ 


ID 


NO:705) 


CTCTGGCTAA 


(SEQ 


ID NO: 706) 


; TCTGGCTAAC 


(SEQ 


ID 


NO:707) 


CTGGCTAACT 


(SEQ 


ID NO: 708) 


; TGGCTAACTA 


(SEQ 


ID 


NO:709) 


GGCTAACTAG 


(SEQ 


ID NO:710) 


; GCTAACTAGG 


(SEQ 


ID 


NO: 711) 


CTAACTAGGG 


(SEQ 


ID NO:712) 


; TAACTAGGGA 


(SEQ 


ID 


NO: 713) 


AACTAGGGAA 


(SEQ 


ID NO:714) 


; ACTAGGGAAC 


(SEQ 


ID 


NO: 715) 


CTAGGGAACC 


(SEQ 


ID NO: 716) 


; TAGGGAACCC 


(SEQ ID NO: 717) 


AGGGAACCCA 


(SEQ 


ID NO:718) 


; GGGAACCCAC 


(SEQ 


ID 


NO: 719) 


GGAACCCACT 


(SEQ 


ID NO: 720) 


; GAACCCACTG 


(SEQ 


ID 


NO: 721) 


AACCCACTGC 


(SEQ 


ID NO:722) 


; ACCCACTGCT 


(SEQ 


ID 


NO: 723) 


CCCACTGCTT 


(SEQ 


ID NO: 724) 




( QFO 


ID NO: 725) 


CACTGCTTAA 


(SEQ 


ID NO: 726) 

■»» AaT *^ %a» • / aW W / 


ACTGOTTAACJ 


/ ABA 

(SEQ 


ID 


NO: 727) 


. CTGCTTAAGC 


(SEQ 


ID NO: 728) 


r TGCTTAAGPr 


(oEQ 


ID 


NO:729) 


GCTTAAGCCT 


(SEQ 


ID NO:730) , 


r CTTAAGCCTC 




ID 


NO: 731) 


TTAAGCCTCA 


(SEQ 


ID NO:732) , 


TAAGCCTCAA 




ID 


NO: 733) 


AAGCCTCAAT 


(SEQ 


ID NO:734) , 


* AGCCTCAATA 


(SEQ 


ID 


NO: 735) 


GCCTCAATAA 


(SEQ 


ID NO:736) ; 


CCTCAATAAA 


(SEQ 


ID 


NO: 737) 


CTCAATAAAG 


(SEQ 


ID NO:738) ; 


TCAATAAAGC 


(SEQ 


ID 


NO:739) 


CAATAAAGCT 


(SEQ 


ID NO:740) ; 


AATAAAGCTT 

**** aV rumwVai x x 


(SEQ 


ID 


NO: 741) 


ATAAAGCTTG 


(SEQ 


ID NO: 742) • 




(SEQ 


ID 


NO: 743) 


AAAGCTTGCC 


(SEO 


ID NO • 744 \ < 


AfiVjL. 1 1 V?wL. A 


(SEQ 


ID 


NO: 745) 


AGCTTGCCTT 


( ^FO 






(SEQ 


ID 


NO: 747) 


CTTGCCTTGA 


f SEO 


Tn wo • ha ft ^ • 




(SEQ 


ID 


NO:749) 


TGCCTTGAGT 


(SEO 


ID NO -750) • 


VJUU X X VUi\J X Vj 


(SEQ 


ID 


NO: 751) 


CCTTGAGTGC 


(SEQ 


ID NO: 752) ; 


CTTGAGTGCT 


(SEQ 


ID 


NO: 753) 


TTGAGTGCTT 


(SEQ 


ID NO: 754) ; 


TGAGTGCTTC 


(SEQ 


ID . 


NO: 755) 


GAGTGCTTCA 


(SEQ 


ID NO:756) ; 


AGTGCTTCAA 


(SEQ 


ID 


NO: 757) 


GTGCTTCAAG 


(SEQ 


ID NO:758) ; 


TGCTTCAAGT 


(SEQ 


ID 


NO:759) 


GCTTCAAGTA 


(SEQ 


ID NO:760) ; 


CTTCAAGTAG 


(SEQ 


ID 


NO:761) , 


TTCAAGTAGT 


(SEQ 


ID NO: 762) ; 


TCAAGTAGTG 


(SEQ 


ID 


NO:763) , 


CAAGTAGTGT 


(SEQ 


ID NO: 764) ; 


AAGTAGTGTG 


(SEQ ID NO: 765) , 


AGTAGTGTGT 


(SEQ 


ID NO:766) ; 


GTAGTGTGTG 


(SEQ 


ID 


NO: 767) , 


TAGTGTGTGC 


(SEQ 


ID NO: 768) ; 


AGTGTGTGCC 


(SEQ 


ID 


NO:769) , 


GTGTGTGCCC 


(SEQ 


ID NO: 770) ; 


TGTGTGCCCG 


(SEQ 


ID 


NO: 771) , 


GTGTGCCCGT 


(SEQ 


ID NO: 772) ; 


TGTGCCCGTC 


(SEQ 


ID 


NO: 773) , 


GTGCCCGTCT 


(SEQ 


ID NO: 774) ; 


TGCCCGTCTG 


(SEQ 


ID 


NO:775) , 
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GCCCGTCTGT 


(SEQ 


ID NO: 776) 


CCCGTCTGTT 


(SEQ 


ID 


NO: 


777) 


CCGTCTGTTG 


(SEQ 


ID NO: 778) 


; CGTCTGTTGT 


(SEQ 


ID 


NO: 


779) 


GTCTGTTGTG 


(SEQ 


ID NO: 780) 


; TCTGTTGTGT 


(SEQ 


ID 


NO: 


781) 


CTGTTGTGTG 


(SEQ 


ID NO: 782) , 


. TGTTGTGTGA 


(SEQ 


ID 


NO: 


783) 


GTTGTGTGAC 


(SEQ 


ID NO: 784) , 


? TTGTGTGACT 


(SEQ 


ID 


NO: 


785) 


TGTGTGACTC 


(SEQ 


ID NO: 786) , 


GTGTGACTCT 


(SEQ 


ID 


NO: 


787) , 


TGTGTGACTC 


(SEQ 


ID NO: 788) ; 


GTGTGACTCT 


(SEQ 


ID 


NO: 


789) j 


TGTGACTCTG 


(SEQ 


ID NO: 790) j 


GTGACTCTGG 


(SEQ 


ID 


NO: 


791) ; 


TGACTCTGGT 


(SEQ 


ID NO: 792) ; 


GACTCTGGTA 


(SEQ 


ID 


NO: 


793) ; 


ACTCTGGTAA 


(SEQ 


ID NO: 794) ; 


CTCTGGTAAC 


(SEQ 


ID 


NO: 


795) ; 


TCTGGTAACT 


(SEQ 


ID NO: 796) ; 


CTGGTAACTA 


(SEQ 


ID 


NO: 


797) ; 


TGGTAACTAG 


(SEQ 


ID NO: 798) ; 


GGTAACTAGA 


(SEQ 


ID 


NO: 


799) . 



15. A strain of HTV-1 according to claim 8 having the identifying characteristics of 
HIV-1 isolate ECACC V94101706 or HTV-1 isolate ECACC V94103 1169 or isolate C54 
deposited at ECACC on 14 February, 1995 under Accession Number . 

16. An isolated strain of HTV-1 which is reactive to antibodies to a glycoprotein of 
HIV-1, is capable of inducing an immune response to at least one of gag,pol and/or env 
and which is incapable of directing synthesis of a nef gene product or a full length nef 
gene product 

17. An isolated strain of HTV-1 according to claim 16 wherein the glycoprotein is 
at least one of gp41-45, gpl20 and/or gpl60. 

18. A nucleic acid molecule or a part, fragment or derivative thereof from an HTV-1 
isolate defined in any one of claims 1 to 16. 

19. A molecular infectious clone comprising a nucleic acid molecule according to 
claim 18. 

20. A method for inhibiting or reducing productive infection of an individual by a 
pathogenic strain of HTV-1, said method comprising administering to a subject a non- 
pathogenic isolate of HIV-1 in an amount effective to infect target cells and to generate 



WO 95/21912 PCT/AU95/00063 

- 199 - 

target cells carrying DNA derived from said non-pathogenic HTV-1. 



21. A method according to claim 20 wherein said isolate is capable of stimulating 
an immune response in a human or primate against at least one glycoprotein on HIV-1 
whilst not substantially reducing proliferative responses and cytokine production to a 
mitogen in said human or primate compared to a healthy, non-infected human or primate 
subject. 

22. A method according to claim 21 wherein said isolate carries one or more 
mutations in its genome resulting in the inability to direct synthesis of at least one 
polypeptide or protein from a pathogenic strain of HIV-1. 

23. A method according to claim 17 wherein said isolate carries one or more 
mutations in its genome resulting in said genome directing synthesis of a truncated form 
of a polypeptide or protein from a pathogenic strain of HTV-1 . 

24. A method according to claim 22 or 23 wherein said HIV-1 isolate carries a 
mutation in its we/ gene and/or long terminal repeat (LTR) region or in a functionally / 
equivalent location in the HIV-1 genome. 

25. A method according to claim 24 wherein said HTV-1 isolate is reactive to 
antibodies to gp41-45, gpl20 and/or gpl60 of HIV-1 and carries a deletion of at least 
ten contiguous nucleotides in a region corresponding to the nef gene and/or LTR region 
of pathogenic HTV-1. 

26. A method according to claim 25 wherein said HIV-1 isolate is capable of 
inducing an immune response to at least one of gag, pol and/or env. 

27. A method according to claim 26 wherein said HIV-1 isolate carries a deletion in 
its genome of at least 10 nucleotides within the region from nucleotide 8787 to 
nucleotide 9709 using the nucleotide numbering of HIV-1 strain NL4-3. 
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28. A method according to claim 27 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (i) 8830-8862; 

(ii) 9009-9035; 

(iii) 9019-9029; and 

(iv) 9033-9049. 

29. A method according to claim 27 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (v) 928 1 -93 7 1 ; 

(vi) 9281-9362; 

(vii) 9105-9224; and 

(viii) 9271-9370. 

30. A method according to claim 27 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (ix) 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 

(xii) 9112-9204. 

31. A method according to claim 27 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (xiii) 9105-9224; 

(xiv) 9389-9395; and 

(xv) 9281-9366. 

32. A method according to claim 27 wherein said deletion encompasses at least one 
of the following decanucleotides from the nef gene of HIV-1 strain NL43: 



ATGGGTGGCA (SEQ ID NO: 2) 

GGGTGGCAAG (SEQ ID NO: 4) 

GTGGCAAGTG (SEQ ID NO: 6) 

GGCAAGTGGT (SEQ ID NO: 8) 



TGGGTGGCAA (SEQ ID NO: 3) ; 

GGTGGCAAGT (SEQ ID NO:5); 

TGGCAAGTGG (SEQ ID NO; 7); 

GCAAGTGGTC (SEQ ID NO: 9); 
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CAAGTGGTCA 


(SEQ 


ID 


NO 


:10) 


AGTGGTCAAA 


' (SEQ 


ID 


NO 


:12) 


TGGTCAAAAA 


(SEQ 


ID 


NO 


:14) 


GTCAAAAAGT 


(SEQ 


ID 


NO 


:16) 


CAAAAAGTAG 


(SEQ 


ID 


NO 


:18) 


AAAAGTAGTG 


(SEQ 


ID 


NO 


:20) 


AAGTAGTGTG 


(SEQ 


ID 


NO 


:22) 


GTAGTGTGAT 


(SEQ 


ID 


NO 


:24) 


AGTGTGATTG 


(SEQ 


ID 


NO 


:26) 


TGTGATTGGA 


(SEQ 


ID 


NO 


:28) 


TGATTGGATG 


(SEQ 


ID 


NO 


:30) 


ATTGGATGGC 


(SEQ 


ID 


NO 


:32) 


TGGATGGCCT 


(SEQ 


ID 


NO 


:34) 


GATGGCCTGC 


(SEQ 


ID 


NOj 


.36) 


TGGCCTGCTG 


(SEQ 


ID 


NO: 


:38) 


GCCTGCTGTA 


(SEQ 


ID 


NO: 


40) 


CTGCTGTAAG 


(SEQ 


ID 


NO: 


42} 


GCTGTAAGGG 


(SEQ 


ID 


NO: 


44) 


TGTAAGGGAA 


(SEQ 


ID 


NO: 


46) 


TAAGGGAAAG 


(SEQ 


ID 


NO: 


48} 


AGGGAAAGAA 


(SEQ 


ID 


NO: 


50) 


GGAAAGAATG 


(SEQ 


ID 


NO: 


52) 


AAAGAATGAG 


(SEQ 


ID 


NO: 


54) 


AGAATGAGAC 


(SEQ 


ID 


NO: 


56) 


AATGAGACGA 


(SEQ 


ID 


NO: 


58) 


TGAGACGAGC 


(SEQ 


ID 


NO: 


60) 


AGACGAGCTG 


(SEQ 


ID 


NO: 


62) 


ACGAGCTGAG 


(SEQ 


ID 


NO: 


64) 


GAGCTGAGCC 


(SEQ 


ID 


NO: 


66) 


GCTGAGCCAG 


(SEQ 


ID 


NO: 


68) 


TGAGCCAGCA 


(SEQ 


ID 


NO: 


70) 


AGCCAGCAGC 


(SEQ 


ID 


NO: 


72) 


CCAGCAGCAG 


(SEQ 


ID 


NO: 


74) 


AGCAGCAGAT 


(SEQ 


ID 


NO: 


76) 


CAGCAGATGG 


(SEQ 


ID 


NO: 


78) 


GCAGATGGGG 


(SEQ 


ID 


NO: 


80) 


AGATGGGGTG 


(SEQ 


ID 


NO: 


82) 


ATGGGGTGGG 


(SEQ 


ID 


NO: 


84) 
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AAGTGGTCAA 


(SEQ 


ID 


NO 


:11 


GTGGTCAAAA 


(SEQ 


ID 


NO 


:13 


GGTCAAAAAG 


(SEQ 


ID 


NO 


:15, 


TCAAAAAGTA 


(SEQ 


ID 


NO 


:17] 


AAAAAGTAGT 


(SEQ ID NO 


:19) ; 


AAAGTAGTGT 


(SEQ 


ID 


NO 


:21] 


AGTAGTGTGA 


(SEQ 


ID 


NO 


:23] 


TAGTGTGATT 


(SEQ ID NO 


:25) 


; 


GTGTGATTGG 


(SEQ 


ID 


NO 


:27) 


GTGATTGGAT 


(SEQ 


ID 


NO 


:29) 


GATTGGATGG 


(SEQ 


ID 


NO: 


31) 


TTGGATGGCC 


(SEQ 


ID 


NO: 


33) 


GGATGGCCTG 


(SEQ 


ID 


NO: 


35) 


ATGGCCTGCT 


(SEQ 


ID 


NO: 


37) 


GGCCTGCTGT 


(SEQ 


ID 


NO: 


39) 


CCTGCTGTAA 


(SEQ 


ID 


NO: 


41) 


TGCTGTAAGG 


(SEQ 


ID 


NO: 


43) 


CTGTAAGGGA 


(SEQ 


ID 


NO: 


45) 


GTAAGGGAAA 


(SEQ 


ID 


NO: 


47) 


AAGGGAAAGA 


(SEQ 


ID 


NO: 


49) 


GGGAAAGAAT 


(SEQ 


ID 


NO: 


51) 


GAAAGAATGA 


(SEQ 


ID 


NO: 


53) 


AAGAATGAGA 


(SEQ 


ID 


NO: 


55) 


GAATGAGACG 


(SEQ 


ID 


NO: 


57) 


ATGAGACGAG 


(SEQ 


ID 


NO: 


59) 


GAGACGAGCT 


(SEQ 


ID 


NO: 


61) 


GACGAGCTGA 


(SEQ 


ID 


NO: 


63) 


CGAGCTGAGC 


(SEQ 


ID 


NO: 


65) 


AGCTGAGCCA 


(SEQ 


ID 


NO: 


67) 


CTGAGCCAGC 


(SEQ 


ID 


NO: 


69) 


GAGCCAGCAG 


(SEQ 


ID 


NO: 


71) 


GCCAGCAGCA 


(SEQ 


ID 


NO: 


73) 


CAGCAGCAGA 


(SEQ 


ID 


NO: 


75) 


GCAGCAGATG 


(SEQ 


ID 


NO: 


77) 


AGCAGATGGG 


(SEQ 


ID 


NO: 


79) 


CAGATGGGGT 


(SEQ 


ID 


NO: 


81) 


GATGGGGTGG 


(SEQ 


ID 


NO: 


83) 


TGGGGTGGGA 


(SEQ 


ID 


NO: 


85) 
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GGGGTGGGAG (SEQ ID NO:86); 
GGTGGGAGCA (SEQ ID NO:88); 
TGGGAGCAGT (SEQ ID NO: 90) ; 
GGAGCAGTAT (SEQ ID NO:92); 
AGCAGTATCT (SEQ ID NO: 94); 
CAGTATCTCG (SEQ ID NO: 96); 
GTATCTCGAG (SEQ ID NO: 98); 
ATCTCGAGAC (SEQ ID NO: 100) ; 
CTCGAGACCT (SEQ ID NO: 102); 
CGAGACCTAG (SEQ ID NO: 104); 
AGACCTAGAA (SEQ ID NO: 106); 
ACCTAGAAAA (SEQ ID NO: 108); 
CTAGAAAAAC (SEQ ID NO: 110); 
AGAAAAACAT (SEQ ID NO: 112); 
AAAAACATGG (SEQ ID NO: 114); 
AAACATGGAG (SEQ ID NO:116) ; 
ACATGGAGCA (SEQ ID NO: 118); 
ATGGAGCAAT (SEQ ID NO: 120) ; 
GGAGCAATCA (SEQ ID NO:122) ; 
AGCAATCACA (SEQ ID NO:124); 
CAATCACAAG (SEQ ID NO:126); 
ATCACAAGTA (SEQ ID NO: 128); 
CACAAGTAGC (SEQ ID NO: 130); 
CAAGTAGCAA (SEQ ID NO: 132) ; 
AGTAGCAATA (SEQ ID NO: 134) ; 
TAGCAATACA (SEQ ID NO: 136) ; 
GCAATACAGC (SEQ ID NO: 13 8) ; 
AATACAGCAG (SEQ ID NO: 140) ; 
TACAGCAGCT (SEQ ID NO: 142); 
CAGCAGCTAA (SEQ ID NO: 144) ; 
GCAGCTAACA (SEQ ID NO: 146); 
AGCTAACAAT (SEQ ID NO: 14 8) ; 
CTAACAATGC (SEQ ID NO:150); 
AACAATGCTG (SEQ ID NO:152); 
CAATGCTGCT (SEQ ID NO:154); 
ATGCTGCTTG (SEQ ID NO:156); 
GCTGCTTGTG (SEQ ID NO:158); 
TGCTTGTGCC (SEQ ID NO:160); 
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GGGTGGGAGC 


(SEQ 


ID 


NO: 


87) 


GTGGGAGCAG 


(SEQ 


ID 


NO: 


89) 


GGGAGCAGTA 


(SEQ 


ID 


NO: 


91) 


GAGCAGTATC 


(SEQ 


ID 


NO: 


93) 


GCAGTATCTC 


(SEQ 


ID 


NO: 


95) 


AGTATCTCGA 


(SEQ 


ID 


NO: 


97) 


TATCTCGAGA 


(SEQ 


ID 


NO: 


99) 


TCTCGAGACC 


(SEQ 


ID 


NO:l 


01) 



TCGAGACCTA (SEQ ID NO: 103) 
GAGACCTAGA (SEQ ID NO: 105) 
GACCTAGAAA (SEQ ID NO: 107) 
CCTAGAAAAA (SEQ ID NO: 109) 
TAGAAAAACA (SEQ ID NO: 111) 
GAAAAACATG (SEQ ID NO: 113) 
AAAACATGGA (SEQ ID NO: 115) 
AACATGGAGC (SEQ ID NO:117) 
CATGGAGCAA (SEQ. ID NO: 119) 
TGGAGCAATC (SEQ ID NO: 121) 
GAGCAATCAC (SEQ ID NO: 123) 
GCAATCACAA (SEQ ID NO:125) 
AATCACAAGT (SEQ ID NO: 127) 
TCACAAGTAG (SEQ ID NO: 129) 
ACAAGTAGCA (SEQ ID NO: 131) 
AAGTAGCAAT (SEQ ID NO: 133) 
GTAGCAATAC (SEQ ID NO: 135) 
AGCAATACAG (SEQ ID NO: 137) 
CAATACAGCA (SEQ ID NO: 139) 
ATACAGCAGC (SEQ ID NO: 141) 
ACAGCAGCTA (SEQ ID NO: 143) 
AGCAGCTAAC (SEQ ID NO: 145) 
CAGCTAACAA (SEQ ID NO: 147) 
GCTAACAATG (SEQ ID NO: 14 9) 
TAACAATGCT (SEQ ID NO: 151) 
ACAATGCTGC (SEQ ID NO: 153) 
AATGCTGCTT (SEQ ID NO: 155) 
TGCTGCTTGT (SEQ ID NO: 157) 
CTGCTTGTGC (SEQ ID NO: 159) . 
GCTTGTGCCT (SEQ ID NO: 161) { 
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CTTGTGCCTG (SEQ ID NO: 162) 
TGTGCCTGGC (SEQ ID NO: 164) 
TGCCTGGCTA (SEQ ID NO: 166) 
CCTGGCTAGA (SEQ ID NO: 168) 
TGGCTAGAAG (SEQ ID NO: 170) 
GCTAGAAGCA (SEQ ID NO: 172) 
TAGAAGCACA (SEQ ID NO: 174) 
GAAGCACAAG (SEQ ID NO: 176) 
AGCACAAGAG (SEQ ID NO: 178) 
CACAAGAGGA (SEQ ID NO: 180) 
CAAGAGGAGG (SEQ ID NO: 182) 
AGAGGAGGAA (SEQ ID NO: 184) 
AGGAGGAAGA (SEQ ID NO: 186) 
GAGGAAGAGG (SEQ ID NO: 188) 
GGAAGAGGTG (SEQ ID NO: 190) 
AAGAGGTGGG (SEQ ID NO: 192) 
GAGGTGGGTT (SEQ ID NO: 194) 
GGTGGGTTTT (SEQ ID NO: 196) 
TGGGTTTTCC (SEQ ID NO: 198) 
GGTTTTCCAG (SEQ ID NO: 200) 
TTTTCCAGTC (SEQ ID NO: 202) 
TTCCAGTCAC (SEQ ID NO: 2 04) 
CCAGTCACAC (SEQ ID NO: 206) 
AGTCACACCT (SEQ ID NO:208) 
TCACACCTCA (SEQ ID NO: 210) 
ACACCTCAGG (SEQ ID NO: 2 12) . 
ACCTCAGGTA (SEQ ID NO: 214), 
CTCAGGTACC (SEQ ID NO:216) ; 
CAGGTACCTT (SEQ ID NO: 218) ; 
GGTACCTTTA (SEQ ID NO: 220) ; 
TACCTTTAAG (SEQ ID NO: 2 22); 
CCTTTAAGAC (SEQ ID NO: 224) ; 
TTTAAGACCA (SEQ ID NO: 226); 
TAAGACCAAT (SEQ ID NO:228) ; 
AGACCAATGA (SEQ ID NO:230); 
ACCAATGACT (SEQ ID NO: 232); 
CAATGACTTA (SEQ ID NO: 234); 
ATGACTTACA .(SEQ ID NO : 23 6 ) ; 
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TTGTGCCTGG 


(SEQ 


ID 


NO 


:163) 


GTGCCTGGCT 


(SEQ 


ID 


NO 


:165) 


GCCTGGCTAG 


(SEQ 


ID 


NO 


:167J 


CTGGCTAGAA 


(SEQ 


ID 


NO 


:169) 


GGCTAGAAGC 


(SEQ 


ID 


NO 


:171) 


CTAGAAGCAC 


(SEQ 


ID 


NO 


:173) 


AGAAGCACAA 


(SEQ 


ID 


NO 


:175) 


AAGCACAAGA 


(SEQ 


ID 


NO 


:177) 


GCACAAGAGG 


(SEQ 


ID 


NO 


:179) 


ACAAGAGGAG 


(SEQ 


ID 


NO 


:181) 


AAGAGGAGGA 


(SEQ 


ID 


NO 


:183) 


GAGGAGGAAG 


(SEQ 


ID 


NO 


185) 


GGAGGAAGAG 


(SEQ 


ID 


NO: 


187) 


AGGAAGAGGT 


(SEO 


ID 


NO: 


189) 


GAAGAGGTGG 


(SEQ 


ID 


NO: 


191) 


AGAGGTGGGT 


(SEQ 


ID 


NO: 


193) 


AGGTGGGTTT 


(SEQ 


ID 


NO: 


195) 


GTGGGTTTTC 


(SEQ 


ID 


NO: 


197) 


GGGTTTTCCA 


(SEQ 


ID 


NO: 


199) 


GTTTTCCAGT 


(SEO 


ID 


NO: 


201) 


TTTCCAGTCA 


(SEO 


ID 


NO- 


ab V J § 




f SEO 


±1/ 










TFJ 
iU 


vt\J : 


/ ) 






JLJJ 


Vi\J : 




PaP2VPPTP2if3 


( qt?o 


XL) 


vrr\ , 

vi\J : 


zxx ; 


CAPPTPAGGT 


f SEO 
\ uuy 


TD 

XLs 




^Xj / 


CCTCAGGTAC 


(SEQ 


ID 


NO: 


215) j 


TCAGGTACCT 


(SEQ 


ID 


NO: 


217) , 


AGGTACCTTT 


(SEQ 


ID 


NO: 


219) , 


GTACCTTTAA 


(SEQ 


ID 


NO: 


221) . 


ACCTTTAAGA 


(SEQ 


ID 


NO: 


223) , 


CTTTAAGACC 


(SEQ 


ID 


NO: 


225) i 


TTAAGACCAA 


(SEQ 


ID 


NO: 


227) , 


AAGACCAATG 


(SEQ 


ID 


NO: 


229) , 


GACCAATGAC 


(SEQ 


ID 


NO: 


231) , 


CCAATGACTT 


(SEQ 


ID 


NO: 


233) t 


AATGACTTAC 


(SEQ 


ID 


NO: 


235) , 


TGACTTACAA 


(SEQ 


ID 


NO: 


237) t 
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GACTTACAAG (SEQ ID NO : 2 3 8 ) ; 

CTTACAAGGC (SEQ ID NO: 240) ; 

TACAAGGCAG (SEQ ID NO:242); 

CAAGGCAGCT (SEQ ID NO: 244) ; 

AGGCAGCTGT ( SEQ ID NO : 2 4 6 ) ; 

GCAGCTGTAG ( SEQ ID NO : 2 4 8 ) ; 

AGCTGTAGAT ( SEQ ID NO : 2 5 0 ) ; 

CTGTAGATCT (SEQ ID NO: 252) ; 

GTAGATCTTA (SEQ ID NO: 254) ; 

AGATCTTAGC (SEQ ID NO: 2 56); 

ATCTTAGCCA (SEQ ID NO: 258) ; 

CTTAGCCACT (SEQ ID NO: 260) ; 

TAGCCACTTT (SEQ ID NO: 262) ; 

GCCACTTTTT (SEQ ID NO : 2 64 ) ; 

CACTTTTTAA (SEQ ID NO: 266) ; 

CTTTTTAAAA (SEQ ID NO: 268); 

TTTTAAAAGA (SEQ ID NO: 270) ; 

TTAAAAGAAA (SEQ ID NO:272) ; 
AAAAGAAAAG (SEQ ID NO: 274) ; 
AAGAAAAGGG (SEQ ID NO: 276); 
GAAAAGGGGG (SEQ ID NO:278) ; 
AAAGGGGGGA (SEQ ID NO:280) ; 
AGGGGGGACT (SEQ ID NO: 282) ; 
GGGGGACTGG (SEQ ID NO: 2 84); 
GGGACTGGAA (SEQ ID NO: 286) ; 
GACTGGAAGG (SEQ ID NO: 288); 
CTGGAAGGGC (SEQ ID NO: 290); 
GGAAGGGCTA (SEQ ID NO:292); 
AAGGGCTAAT (SEQ ID NO: 2 94) ; 
GGGCTAATTC ( SEQ ID NO : 2 9 6 ) ; 
GCTAATTCAC (SEQ ID NO: 2 98) ; 
TAATTCACTC (SEQ ID NO: 300); 
ATTCACTCCC (SEQ ID NO: 302); 
TCACTCCCAA (SEQ ID NO: 3 04) ; 
ACTCCCAAAG (SEQ ID NO:306); 
TCCCAAAGAA (SEQ ID NO:308) ; 
CCAAAGAAGA (SEQ ID NO:310); 
AAAGAAGACA (SEQ ID NO: 312) ; 
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ACTTACAAGG (SEQ ID NO: 23 9) , 

TTACAAGGCA (SEQ ID NO:241) i 

ACAAGGCAGC (SEQ ID NO: 243).; 

AAGGCAGCTG (SEQ ID NO:245) j 

GGCAGCTGTA (SEQ ID NO: 247); 

CAGCTGTAGA (SEQ ID NO: 249) ; 

GCTGTAGATC (SEQ ID NO:251) ; 

TGTAGATCTT (SEQ ID NO: 2 53); 

TAGATCTTAG (SEQ ID NO: 255) ; 

GATCTTAGCC (SEQ ID NO: 257) ; 

TCTTAGCCAC (SEQ ID NO: 259) ; 

TTAGCCACTT (SEQ ID NO: 261) ; 

AGCCACTTTT (SEQ ID NO: 263) ; 

CCACTTTTTA (SEQ ID NO: 265) ; 

ACTTTTTAAA (SEQ ID NO: 267) ; 

TTTTTAAAAG (SEQ ID NO: 269) ; 

TTTAAAAGAA (SEQ ID NO: 271) ; 

TAAAAGAAAA (SEQ ID NO: 273) ; 
AAAGAAAAGG (SEQ ID NO: 275) ; 
AGAAAAGGGG (SEQ ID NO: 277) ; 
AAAAGGGGGG (SEQ ID NO:279); 
AAGGGGGGAC (SEQ ID NO: 281) ; 
GGGGGGACTG (SEQ ID NO: 283) ; 
GGGGACTGGA (SEQ ID NO:285) ; 
GGACTGGAAG (SEQ ID NO: 287); 
ACTGGAAGGG (SEQ ID NO:289); 
TGGAAGGGCT (SEQ ID NO:291); 
GAAGGGCTAA (SEQ ID NO:293) ; 
AGGGCTAATT (SEQ ID NO: 295) ; 
GGCTAATTCA (SEQ ID NO:297) ; 
CTAATTCACT (SEQ ID NO: 299) ; 
AATTCACTCC (SEQ ID NO: 301) ; 
TTCACTCCCA (SEQ ID NO:303); 
CACTCCCAAA (SEQ ID NO:305) ; 
CTCCCAAAGA (SEQ ID NO:307) ; 
CCCAAAGAAG (SEQ ID NO: 309) ; 
CAAAGAAGAC (SEQ ID NO: 311) ; 
AAGAAGACAA (SEQ ID NO: 3 13) ; 
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AGAAGACAAG 


(SEQ 


ID 


NO 


:314 


AAGACAAGAT 


(SEQ 


ID 


NO 


:316 


GACAAGATAT 


(SEQ 


ID 


NO 


:318 


CAAGATATCC 


(SEQ 


ID 


NO 


:320 


AGATATCCTT 


(SEQ 


ID 


NO 


:322 


ATATCCTTGA 


(SEQ 


ID 


NO 


:324 


ATCCTTGATC 


(SEQ 


ID 


NO 


:326 


CCTTGATCTG 


(SEQ 


ID 


NO 


:328 


TTGATCTGTG 


(SEQ 


ID 


NO 


:330 


GATCTGTGGA 


(SEQ 


ID 


NO 


:332 


TCTGTGGATC 


(SEQ 


ID 


NO, 


334 


TGTGGATCTA 


(SEQ 


ID 


NO: 


336 


TGGATCTACC 


(SEQ 


ID 


NO: 


338 


GATCTACCAC 


(SEQ 


ID 


NO: 


340] 


TCTACCACAC 


(SEQ 


ID 


NO: 


342! 


TACCACACAC 


(SEQ 


ID 


NO: 


344! 


CCACACACAA 


(SEQ 


ID 


NO: 


346! 


ACACACAAGG 


(SEQ 


ID 


NO: 


348] 


ACACAAGGCT 


(SEQ 


ID 


NO: 


350! 


ACAAGGCTAC 


(SEO 


ID 


NO: 


352] 


nnvjvju x Xaw x X 




XXV 


wn • 

Vi\J l 


OCA 1 


wwV* X X \_»V_ 




J. u 


ViKJ Z 




CTACTTCCCT 

x* inv x <l vowv* x 


f SEO 


in 




O 3D i 


ACTTCCCTGA 


(SEO 


ID 


NO: 


360) 


TTCCCTGATT 


(SEQ 


ID 


NO: 


362) 


CCCTGATTGG 


(SEQ 


ID 


NO: 


364) 


CTGATTGGCA 


(SEQ 


ID 


NO: 


366) 


GATTGGCAGA 


(SEQ 


ID 


NO: 


368) 


TTGGCAGAAC 


(SEQ 


ID 


NO: 


370) 


GGCAGAACTA 


(SEQ 


ID 


NO: 


372) 


CAGAACTACA 


(SEQ 


ID 


NO: 


374) 


GAACTACACA 








*5 / 0 J 


ACTACACACC 


(SEQ 


ID 


NO: 


378) 


TACACACCAG 


(SEQ 


ID 


NO: 


380) 


CACACCAGGG 


(SEQ 


ID 


NO: 


382) 


CACCAGGGCC 


(SEQ 


ID 


NO: 


384) 


CCAGGGCCAG 


(SEQ 


ID 


NO: 


386) 


AGGGCCAGGG 


(SEQ 


ID 


NO: 


388) 
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GAAGACAAGA 


(SEQ 


ID 


NO 


:315) 


AGACAAGATA 


(SEQ 


ID 


NO 


:317) 


ACAAGATATC 


(SEQ 


ID 


NO 


:319) 


AAGATATCCT 


(SEQ 


ID 


NO 


:321) 


GATATCCTTG 


(SEQ 


ID 


NO 


:323) 


TATCCTTGAT 


(SEQ 


ID 


NO 


:325) 


TCCTTGATCT 


(SEQ 


ID 


NO 


:327) 


CTTGATCTGT 


(SEQ 


ID 


NO 


:329) 


TGATCTGTGG 


(SEQ 


ID 


NO 


:331) 


ATCTGTGGAT 


(SEQ 


ID 


NO; 


,333) 


CTGTGGATCT 


(SEQ 


ID 


NO: 


335) 


GTGGATCTAC 


(SEQ 


ID 


NO: 


337) 


GGATCTACCA 


(SEQ 


ID 


NO: 


339) 


ATCTACCACA 


(SEO 


ID 


NO: 


341) 


CTACCACACA 


(SEO 


ID 


NO: 


343) 


ACCACACACA 


(SEO 


ID 


NO: 


345) 


CACACACAAG 


(SEQ 


ID 


NO: 


347) 


CACACAAGGC 


(SEQ 


ID 


NO: 


349) 


CACAAGGCTA 


(SEO 


ID 


NO: 


351) 


CAAGGCTACT 


I SEO 


ID 


NO • 


) 


nuwvlnV«l XO 




XU 


TSJ/-V . 

inu : 


ODD ) 


V?l*X.A\*x XLLL 




TFl 
±U 
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JJ / / 


lAu X X v~V_ X Vj 




xu 


KTO • 
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CTTCCCTGAT 


(SEO 


ID 


NO • 


JOl/ 


TCCCTGATTG 


(SEO 


ID 


NO: 


363) 


CCTGATTGGC 


(SEQ 


ID 


NO: 


365) 


TGATTGGCAG 


(SEQ 


ID 


NO: 


367) 


ATTGGCAGAA 


(SEQ 


ID 


NO: 


369) 


TGGCAGAACT 


(SEQ 


ID 


NO: 


371) 


GCAGAACTAC 


(SEQ 


ID 


NO: 


373) 


AGAACTACAC 


(SEQ 


ID 


NO: 


375) 


AACTACACAP 




m 

±u 


un • 

Vi\J Z 


6 11) 


CTACACACCA 


(SEQ 


ID 


NO: 


379) 


ACACACCAGG 


(SEQ 


ID 


NO: 


381) 


ACACCAGGGC 


(SEQ 


ID 


NO: 


383) 


ACCAGGGCCA 


(SEQ 


ID 


NO: 


385) 


CAGGGCCAGG 


(SEQ 


ID 


NO: 


387) 


GGGCCAGGGG 


(SEQ 


ID 


NO: 


389) 
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GGCCAGGGGT (SEQ ID NO: 390) 
CCAGGGGTCA (SEQ ID NO: 3 92) 
AGGGGTCAGA (SEQ ID NO: 394) 
GGGTCAGATA (SEQ ID NO: 396) 
GTCAGATATC (SEQ ID NO: 398) 
CAGATATCCA (SEQ ID NO: 4 00) 
GATATCCACT (SEQ ID NO: 402) 
TATCCACTGA (SEQ ID NO: 4 04) 
TCCACTGACC (SEQ ID NO: 4 06) 
CACTGACCTT (SEQ ID NO: 408) 
CTGACCTTTG (SEQ ID NO: 410) 
GACCTTTGGA (SEQ ID NO: 4 12) 
CCTTTGGATG (SEQ ID NO: 4 14) 
TTTGGATGGT (SEQ ID NO: 416) 
TGGATGGTGC (SEQ ID NO: 418) 
GATGGTGCTA (SEQ ID NO: 420) 
TGGTGCTACA (SEQ ID NO: 422) , 
GTGCTACAAG (SEQ ID NO: 4 24), 
GCTACAAGCT (SEQ ID NO:426) ; 
TACAAGCTAG (SEQ ID NO:428) ; 
CAAGCTAGTA (SEQ ID NO:430) ; 
AGCTAGTACC (SEQ ID NO: 432) ; 
CTAGTACCAG (SEQ ID NO: 434) ; 

AGTACCAGTT (SEQ ID NO: 436) ; 

TACCAGTTGA (SEQ ID NO:438) ; 

CCAGTTGAGC (SEQ ID NO: 440) ; 

AGTTGAGCCA (SEQ ID NO: 442) ; 

TTGAGCCAGA (SEQ ID NO: 444); 

GAGCCAGATA (SEQ ID NO: 44 6) ; 

GCCAGATAAG (SEQ ID NO:448) ; 

CAGATAAGGT (SEQ ID NO:450) ; 

GATAAGGTAG (SEQ ID NO: 452) ; 

TAAGGTAGAA (SEQ ID NO:454) ; 

AGGTAGAAGA (SEQ ID NO:456) ; 

GTAGAAGAGG (SEQ ID NO: 458) ; 

AGAAGAGGCC (SEQ ID NO:460) ; 

AAGAGGCCAA (SEQ ID NO:462) ; 

GAGGCCAATA (SEQ ID NO: 464) ; 
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GCCAGGGGTC ( SEQ ID NO: 391) ; 
CAGGGGTCAG (SEQ ID NO: 3 93); 
GGGGTCAGAT (SEQ ID NO:395); 
GGTCAGATAT (SEQ ID NO: 3 97) ; 
TCAGATATCC (SEQ ID NO: 3 99); 

AGATATCCAC (SEQ ID NO: 401).; 

ATATCCACTG (SEQ ID NO: 403) ; 

ATCCACTGAC ( SEQ ID NO : 4 0 5 ) ; 

CCACTGACCT (SEQ ID NO: 407) ; 

ACTGACCTTT (SEQ ID NO: 409) ; 

TGACCTTTGG (SEQ ID N0:411) ; 

ACCTTTGGAT (SEQ ID NO: 4 13) ; 

CTTTGGATGG (SEQ ID NO: 415); 

TTGGATGGTG (SEQ ID NO: 4 17) ; 

GGATGGTGCT (SEQ ID NO: 419) ; 

ATGGTGCTAC (SEQ ID NO: 421) ; 

GGTGCTACAA (SEQ ID NO: 423) ; 

TGCTACAAGC (SEQ ID NO: 425) ; 

CTACAAGCTA (SEQ ID NO: 427) ; 

ACAAGCTAGT (SEQ ID NO: 429) ; 

AAGCTAGTAC (SEQ ID NO: 431) ; 

GCTAGTACCA (SEQ ID NO: 433) ; 

TAGTACCAGT ( SEQ ID NO : 4 3 5 ) ; 

GTACCAGTTG (SEQ ID NO: 43 7) ; 
ACCAGTTGAG (SEQ ID NO:439) ; 
CAGTTGAGCC (SEQ ID NO: 441) ; 
GTTGAGCCAG (SEQ ID NO: 443) ; 
TGAGCCAGAT (SEQ ID NO: 445) ; 
AGCCAGATAA (SEQ ID NO:447) ; 
CCAGATAAGG (SEQ ID NO: 449) ; 
AGATAAGGTA (SEQ ID NO: 451) ; 
ATAAGGTAGA (SEQ ID NO: 453) ; 
AAGGTAGAAG (SEQ ID NO:455) ; 
GGTAGAAGAG (SEQ ID NO: 457) ; 
TAGAAGAGGC (SEQ ID NO:459) ; 
GAAGAGGCCA (SEQ ID NO:461) ; 
AGAGGCCAAT (SEQ ID NO: 4 63) ; 
AGGCCAATAA (SEQ ID NO:465) ; 
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GGCCAATAAA (SEQ ID NO:466 
CCAATAAAGG (SEQ ID NO: 468 
AATAAAGGAG (SEQ ID NO: 470 
TAAAGGAGAG (SEQ ID NO: 472 
AAGGAGAGAA (SEQ ID NO: 4 74 
GGAGAGAACA (SEQ ID NO: 476 
AGAGAACACC (SEQ ID NO: 478 
AGAACACCAG (SEQ ID NO: 480 
AACACCAGCT (SEQ ID NO: 482 
CACCAGCTTG (SEQ ID NO: 484 
CCAGCTTGTT (SEQ ID NO: 486! 
AGCTTGTTAC (SEQ ID NO: 4 88] 
CTTGTTACAC (SEQ ID NO: 490] 
TGTTACACCC (SEQ ID NO: 492] 
TTACACCCTG (SEQ ID NO: 4 94] 
ACACCCTGTG (SEQ ID NO: 4 96] 
ACCCTGTGAG (SEQ ID NO: 498) 
CCTGTGAGCC (SEQ ID NO: 500) 
TGTGAGCCTG (SEQ ID NO: 502) 
TGAGCCTGCA (SEQ ID NO: 504) 
AGCCTGCATG (SEQ ID NO: 506) 
CCTGCATGGA (SEQ ID NO: 50 8) 
TGCATGGAAT (SEQ ID NO: 510) 
CATGGAATGG (SEQ ID NO: 512) 
TGGAATGGAT (SEQ ID NO: 514) 
GAATGGATGA (SEQ ID NO: 516) 
ATGGATGACC (SEQ ID NO: 518) 
GGATGACCCT (SEQ ID NO: 520) 
ATGACCCTGA (SEQ ID NO: 522) 
GACCCTGAGA (SEQ ID NO: 524) 
CCCTGAGAGA (SEQ ID NO: 526) 
CTGAGAGAGA (SEQ ID NO: 528) 
GAGAGAGAAG (SEQ ID NO: 530) 
GAGAGAAGTG (SEQ ID NO: 532) 
GAGAAGTGTT (SEQ ID NO: 534) 
GAAGTGTTAG (SEQ ID NO: 536) 
AGTGTTAGAG (SEQ ID NO: 538) 
TGTTAGAGTG (SEQ ID NO: 540) 
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GCCAATAAAG 


(SEQ 


ID 


NO 


:467) 


CAATAAAGGA 


(SEQ 


ID 


NO 


:469) 


ATAAAGGAGA 


(SEQ 


ID 


NO 


:471) 


AAAGGAGAGA 


(SEQ 


ID 


NO 


:473) 


AGGAGAGAAC 


(SEQ 


ID 


NO 


:475) 


GAGAGAACAC 


(SEQ 


ID 


NO 


:477) 


GAGAACACCA 


(SEQ 


ID 


NO 


:479) 


GAACACCAGC 


(SEQ 


ID 


NO 


:481) 


ACACCAGCTT 


(SEQ 


ID 


NO 


:483) 


ACCAGCTTGT 


(SEQ 


ID 


NO 


:485) 


CAGCTTGTTA 


(SEQ 


ID 


NO 


:487) 


GCTTGTTACA 


(SEQ 


ID 


NO 


:489) 


TTGTTACACC 


(SEO 


ID 

XXj/ 


NO- 


491) 




(SEO 


ID 

X XV 


NO- 


"i j j i 


TACACCCTGT 


(SEO 


ID 


NO: 


495) 


CACCCTGTGA 


(SEO 


ID 


NO: 


497) 


CCCTGTGAGC 


(SEQ 


ID 


NO: 


499) 


CTGTGAGCCT 


(SEO 


ID 


NO: 


501) 


GTGAGCCTGC 


(SEO 




NO* 




wn\jLV# x VjV^aX 




±u 


vt\j : 


cnc\ 

OKJD } 


uLL 1 CjUA 1 \j\3 


/ CPA 


±U 


1MU : 


507 ) 




/ PDA 


TT\ 
XU 


riU : 








TT\ 

1.U 


xNvJ : 


CI 1 \ 

oil ; 






J.JJ 


jnu : 




GGAATGGATG 




Xi/ 






AATGGATGAC 


(SEO 


ID 


NO: 


517) 


TGGATGACCC 


(SEQ 


ID 


NO: 


519) 


GATGACCCTG 


(SEQ 


ID 


NO: 


521) 


TGAGCCTGAG 


(SEQ 


ID 


NO: 


523) 


ACCCTGAGAG 


(SEQ 


ID 


NO: 


525) 


CCTGAGAGAG 


(SEQ 


ID 


NO: 


527) 


TGAGAGAGAA 


(SEO 


ID 


NO • 




AGAGAGAAGT 


(SEQ 


ID 


NO: 


531) 


AGAGAAGTGT 


(SEQ 


ID 


NO: 


533) 


AGAAGTGTTA 


(SEQ 


ID 


NO: 


535) 


AAGTGTTAGA 


(SEQ 


ID 


NO: 


537) 


GTGTTAGAGT 


(SEQ 


ID 


NO: 


539) 


GTTAGAGTGG 


(SEQ 


ID 


NO: 


541) 



WO 95/21912 



PCT/AU95/00063 



TTAGAGTGGA (SEQ ID NO: 542) 
AGAGTGGAGG (SEQ ID NO: 544) 
AGTGGAGGTT (SEQ ID NO: 546) 
TGGAGGTTTG (SEQ ID NO: 548) 
GAGGTTTGAC (SEQ ID NO: 550) 
GGTTTGACAG (SEQ ID NO: 552) 
TTTGACAGCC (SEQ ID NO: 554) 
TGACAGCCGC (SEQ ID NO: 556) 
ACAGCCGCCT (SEQ ID NO: 558) 
AGCCGCCTAG (SEQ ID NO: 560) 
CCGCCTAGCA (SEQ ID NO: 562) 
GCCTAGCATT (SEQ ID NO: 564) 
CTAGCATTTC (SEQ ID NO: 566) 
AGCATTTCAT (SEQ ID NO: 568) 
CATTTCATCA (SEQ ID NO: 570) 
TTTCATCACG (SEQ ID NO: 572) 
TCATCACGTG ( SEQ ID NO : 5 74 ) 
ATCACGTGGC (SEQ ID NO: 576) 
CACGTGGCCC (SEQ ID NO: 578) 
CGTGGCCCGA (SEQ ID NO : 58 0 ) , 
TGGCCCGAGA (SEQ ID NO: 582 ) ; 
GCCCGAGAGC (SEQ ID NO: 584), 
CCGAGAGCTG ( SEQ ID NO : 5 8 6 ) ; 
GAGAGCTGCA (SEQ ID NO: 588); 
GAGCTGCATC (SEQ ID NO: 590) ; 
GCTGCATCCG (SEQ ID NO: 592) ; 
TGCATCCGGA (SEQ ID NO: 5 94) ; 
CATCCGGAGT (SEQ ID NO:596) ; 
TCCGGAGTAC (SEQ ID NO: 598) ; 
CGGAGTACTT (SEQ ID NO:600) ; 
GAGTACTTCA (SEQ ID NO: 602) ; 
GTACTTCAAG (SEQ ID NO: 604) ; 
ACTTCAAGAA (SEQ ID NO:606) ; 
TTCAAGAACT (SEQ ID NO: 608) ; 
CAAGAACTGC (SEQ ID NO: 610); 
AGAACTGCTG (SEQ ID NO: 612); 
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TAGAGTGGAG (SEQ ID NO: 543) 
GAGTGGAGGT (SEQ ID NO: 545) 
GTGGAGGTTT (SEQ ID NO: 547) 
GGAGGTTTGA (SEQ ID NO: 54 9) 
AGGTTTGACA (SEQ ID NO: 551) 
GTTTGACAGC (SEQ ID NO: 553) 
TTGACAGCCG (SEQ ID NO: 555) 
GACAGCCGCC (SEQ ID NO: 557) 
CAGCCGCCTA (SEQ ID NO: 559) 
GCCGCCTAGC (SEQ ID NO: 561) 
CGCCTAGCAT (SEQ ID NO: 563) , 
CCTAGCATTT (SEQ ID NO: 565) , 
TAGCATTTCA (SEQ ID NO: 567) , 
GCATTTCATC (SEQ ID NO: 569) ; 
ATTTCATCAC (SEQ ID NO:571) j 
TTCATCACGT (SEQ ID NO: 573) ; 
CATCACGTGG (SEQ ID NO: 575) ; 
TCACGTGGCC (SEQ ID NO: 577) ; 
ACGTGGCCCG (SEQ ID NO: 579) ; 
GTGGCCCGAG (SEQ ID NO: 581) ; 
GGCCCGAGAG (SEQ ID NO: 583) ; 
CCCGAGAGCT (SEQ ID NO: 585); 
CGAGAGCTGC (SEQ ID NO: 5 87) ; 
AGAGCTGCAT (SEQ ID NO: 589); 
AGCTGCATCC (SEQ ID NO: 591) ; 
CTGCATCCGG (SEQ ID NO: 593) ; 
GCATCCGGAG (SEQ ID NO: 595); 
ATCCGGAGTA (SEQ ID NO: 597); 
CCGGAGTACT (SEQ ID NO: 599) ; 
GGAGTACTTC (SEQ ID NO: 601); 
AGTACTTCAA (SEQ ID NO: 603) ; 
TACTTCAAGA (SEQ ID NO: 605); 
CTTCAAGAAC (SEQ ID NO: 607); 
TCAAGAACTG (SEQ ID NO: 609); 
AAGAACTGCT (SEQ ID NO: 611); 
GAACTGCTGA (SEQ ID NO: 613). 
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33. A method according to claim 27 wherein said deletion encompasses at least one 
of the following decanucleotides from the LTR region of HIV- 1 strain NL4-3: 
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GGGTCTCTCT 
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GGTCTCTCTG 
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GTCTCTCTGG 
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(SEQ 


ID 


NO: 686) ; 


AGATCTGAGC 


(SEQ 


ID 


NO:687) 


GATCTGAGCC 


(SEQ 


ID 


NO:688) ; 


ATCTGAGCCT 


(SEQ 


ID 


NO:689) 


TCTGAGCCTG 


(SEQ 


ID 


NO:690) ; 


CTGAGCCTGG 


(SEQ 


ID 


NO:691) 


TGAGCCTGGG 
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TCTGGTAACT (SEQ ID NO: 7 96); CTGGTAACTA (SEQ ID NO: 797); 
TGGTAACTAG ( SEQ ID NO: 798) ; GGTAACTAGA ( SEQ ID NO : 7 9 9 ) . 

34. A method according to claim 24 wherein said HTV-1 isolate has the identifying 
characteristics of HIV-1 isolate ECACC V94101706 or HTV-1 isolate ECACC 
V941031 169 or isolate C54 deposited at ECACC on 14 February, 1995 under Accession 
Number . 

35. A method according to claim 20 wherein the HIV-1 isolate is reactive to 
antibodies to a glycoprotein of HIV-1, is capable of inducing an immune response of 
gag, pol and/or ertv and which is incapable of directing synthesis of a we/gene product 
or a full length mf gene product 

36. A method according to claim 35 wherein the glycoprotein is at least one of gp41- 
45, gpl20 and/or gpl60. 

37. A method for vaccinating an individual against the development of AIDS or 
AIDS related diseases, said method comprising administering to said individual a non- 
pathogenic isolate of HIV-1 in an amount effective to infect target cells and to generate 
target cells carrying DNA derived from said non-pathogenic HIV-1. 

38. A method according to claim 37 wherein said isolate is capable of stimulating 
an immune response in a human or primate against at least one glycoprotein on HIV-1 
whilst not substantially reducing proliferative responses and cytokine production to a 
mitogen in said human or primate compared to a healthy, non-infected human or primate 
subject 

39. A method according to claim 38 wherein said isolate carries one or more 
mutations in its genome resulting in the inability to direct synthesis of at least one 
polypeptide or protein from a pathogenic strain of HTV-1. 
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40. A method according to claim 17 wherein said isolate carried one or more 
mutations- in its genome resulting in said genome directing synthesis of a truncated form 
of a polypeptide or protein from a pathogenic strain of HIV-1. 

41. A method according to claim 39 or 40 wherein said HTV-1 isolate carries a 
mutation in its nef gene and/or long terminal repeat (LTR) region or in a functionally 
equivalent location in the HIV-1 genome. 

42. A method according to claim 40 wherein said HIV-1 isolate is reactive to 
antibodies to gp41-45, gpl20 and/or gpl60 of HIV-1 and carries a deletion of at least 
ten contiguous nucleotides in a region corresponding to the nef gene and/or LTR region 
of pathogenic HIV-1. 

43. A method according to claim 42 wherein said HTV-1 isolate is capable of 
inducing an immune response to at least one of gag, pol and/or em. 

44. A method according to claim 43 wherein said HIV-1 isolate carries a deletion in 
its genome of at least 10 nucleotides within the region from nucleotide 8787 to 
nucleotide 9709 using the nucleotide numbering of HIV-1 strain NL4-3. 

45. A method according to claim 44 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (i) 8830-8862; 



46. A method according to claim 44 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 



(") 

(iii) 
(iv) 



9009-9035; 
9019-9029; and 



9033-9049. 



nucleotide 



(v) 
(vi) 



9281-9371; 
9281-9362; 
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(vii) 9105-9224; and 

(viii) 9271-9370. 



47. A method according to claim 44 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (ix) 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 

(xii) 9112-9204. 



47. A method according to claim 44 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (xiii) 9 1 05-9224; 

(xiv) 9389-9395; and 

(xv) 9281-9366. 



48. A method according to claim 44 wherein said deletion encompasses at least one 
of the following decanucleotides from the nef gene of HIV-1 strain NL43: 



ATGGGTGGCA (SEQ ID NO: 2) ; 
GGGTGGCAAG (SEQ ID NO: 4) ; 
GTGGCAAGTG (SEQ ID NO: 6); 
GGCAAGTGGT (SEQ ID NO: 8); 
CAAGTGGTCA (SEQ ID NO: 10) 
AGTGGTCAAA (SEQ ID NO: 12) 
TGGTCAAAAA (SEQ ID NO: 14) 
GTCAAAAAGT (SEQ ID NO: 16) 
CAAAAAGTAG (SEQ ID NO: 18) 
AAAAGTAGTG (SEQ ID NO: 20) 
AAGTAGTGTG (SEQ ID NO: 22) 
GTAGTGTGAT (SEQ ID NO: 24) 
AGTGTGATTG (SEQ ID NO: 26) 
TGTGATTGGA (SEQ ID NO: 28) 
TGATTGGATG (SEQ ID NO: 30) 
ATTGGATGGC (SEQ ID NO: 32) 
TGGATGGCCT (SEQ ID NO: 34) 



TGGGTGGCAA (SEQ ID NO:3) ; 

GGTGGCAAGT (SEQ ID NO: 5); 

TGGCAAGTGG (SEQ ID NO: 7) ; 

GCAAGTGGTC (SEQ ID NO: 9); 

AAGTGGTCAA (SEQ ID NO: 11) 

GTGGTCAAAA (SEQ ID NO: 13) 

GGTCAAAAAG (SEQ ID NO: 15) 

TCAAAAAGTA (SEQ ID NO: 17) 

AAAAAGTAGT (SEQ ID NO: 19); 

AAAGTAGTGT ( SEQ ID NO : 2 1 ) ; 

AGTAGTGTGA ( SEQ ID NO : 2 3 ) ; 

TAGTGTGATT (SEQ ID NO: 25) ; 

GTGTGATTGG (SEQ ID NO: 27) 

GTGATTGGAT (SEQ ID NO: 29) 

GATTGGATGG (SEQ ID NO: 31) 

TTGGATGGCC (SEQ ID NO: 33) 

GGATGGCCTG (SEQ ID NO: 35) 
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GATGGCCTGC (SEQ ID NO: 36) 
TGGCCTGCTG (SEQ ID NO: 38) 
GCCTGCTGTA (SEQ ID NO: 40) 
CTGCTGTAAG (SEQ ID NO: 42) 
GCTGTAAGGG (SEQ ID NO: 44) 
TGTAAGGGAA (SEQ ID NO: 46) 
TAAGGGAAAG (SEQ ID NO:48) 
AGGGAAAGAA (SEQ ID NO: 50) 
GGAAAGAATG (SEQ ID NO:52) 
AAAGAATGAG (SEQ ID NO: 54) 
AGAATGAGAC (SEQ ID NO: 56) 
AATGAGACGA (SEQ ID NO: 58) 
TGAGACGAGC (SEQ ID NO: 60) 
AGACGAGCTG (SEQ ID NO: 62) 
ACGAGCTGAG (SEQ ID NO: 64) 
GAGCTGAGCC (SEQ ID NO: 66) 
GCTGAGCCAG (SEQ ID NO: 68) 
TGAGCCAGCA (SEQ ID NO: 70) 
AGCCAGCAGC (SEQ ID NO: 72) 
CCAGCAGCAG (SEQ ID NO: 74) , 
AGCAGCAGAT (SEQ ID NO: 76) ; 
CAGCAGATGG (SEQ ID NO:78) J 
GCAGATGGGG (SEQ ID NO: 80); 
AGATGGGGTG (SEQ ID NO: 82); 
ATGGGGTGGG (SEQ ID NO: 84); 
GGGGTGGGAG (SEQ ID NO: 86); 
GGTGGGAGCA (SEQ ID NO: 88); 
TGGGAGCAGT (SEQ ID NO: 90) ; 
GGAGCAGTAT (SEQ ID NO: 92); 
AGCAGTATCT (SEQ ID NO: 94); 
CAGTATCTCG (SEQ ID NO: 96) ; 
GTATCTCGAG (SEQ ID NO: 98); 
ATCTCGAGAC (SEQ ID NO: 100) 
CTCGAGACCT (SEQ ID NO: 102) 
CGAGACCTAG (SEQ ID NO: 104) 
AGACCTAGAA (SEQ ID NO: 106) 
ACCTAGAAAA (SEQ ID NO: 108) 
CTAGAAAAAC (SEQ ID NO: 110) 



-214- 



ATGGCCTGCT 


(SEQ 


ID 


NO 


:37) 


GGCCTGCTGT 


(SEQ 


ID 


NO 


:39) 


CCTGCTGTAA 


(SEQ 


ID 


NO 


:41) 


TGCTGTAAGG 


(SEQ 


ID 


NO 


:43) 


CTGTAAGGGA 


(SEQ 


ID 


NO 


:45) 


GTAAGGGAAA 


(SEQ 


ID 


NO 


:47) 


AAGGGAAAGA 


(SEQ 


ID 


NO 


:49) 


GGGAAAGAAT 


(SEQ 


ID 


NO 


:51) 


GAAAGAATGA 


(SEQ 


ID 


NO 


:53) 


AAGAATGAGA 


(SEQ 


ID 


NO 


:55) 


GAATGAGACG 


(SEQ 


ID 


NO 


:57) 


ATGAGACGAG 


(SEQ 


ID 


NO 


:59) 


GAGACGAGCT 


(SEQ 


ID 


NO 


:61) 


GACGAGCTGA 


(SEQ 


ID 


NO: 


63) 


CGAGCTGAGC 


(SEQ 


ID 


NO: 


65) 


AGCTGAGCCA 


(SEQ 


ID 


NO: 


67) 


CTGAGCCAGC 


(SEQ 


ID 


NO: 


69) 


GAGCCAGCAG 


(SEQ 


ID 


NO: 


71) 


GCCAGCAGCA 


(SEQ 


ID 


NO: 


73) 


CAGCAGCAGA 


(SEQ 


ID 


NO: 


75) 


GCAGCAGATG 


(SEQ 


ID 


NO: 


77) 


AGCAGATGGG 


(SEQ 


ID 


NO: 


79) 


CAGATGGGGT 


(SEQ 


ID 


NO: 


81) 


GATGGGGTGG 


(SEQ 


ID 


NO: 


83) 


TGGGGTGGGA 


(SEQ 


ID 


NO: 


85) 


GGGTGGGAGC 


(SEQ 


ID 


NO: 


87) 


GTGGGAGCAG 


(SEQ 


ID 


NO: 


89) 


GGGAGCAGTA 


(SEQ 


ID 


NO: 


91) 


GAGCAGTATC 


(SEQ 


ID 


NO: 


93) , 


GCAGTATCTC 


(SEQ 


ID 


NO: 


95), 


AGTATCTCGA 


(SEQ 


ID 


NO: 


97) , 


TATCTCGAGA 


(SEQ 


ID 


NO: 


99) j 



TCTCGAGACC (SEQ ID NO: 101) 

TCGAGACCTA (SEQ ID NO: 103); 

GAGACCTAGA (SEQ ID NO : 105 ) ; 

GACCTAGAAA (SEQ ID NO:107) ; 

CCTAGAAAAA (SEQ ID NO: 109) ; 

TAGAAAAACA (SEQ ID NO: 111); 
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AGAAAAACAT (SEQ ID NO: 112 
AAAAACATGG (SEQ ID NO: 114 
AAACATGGAG (SEQ ID NO: 116 
ACATGGAGCA (SEQ ID NO: 118 
ATGGAGCAAT (SEQ ID NO: 120 
GGAGCAATCA (SEQ ID NO: 122 
AGCAATCACA (SEQ ID NO: 124 
CAATCACAAG (SEQ ID NO: 126 
ATCACAAGTA (SEQ ID NO: 128! 
CACAAGTAGC (SEQ ID NO: 130! 
CAAGTAGCAA (SEQ ID NO: 132! 
AGTAGCAATA (SEQ ID NO: 134! 
TAGCAATACA (SEQ ID NO: 136] 
GCAATACAGC (SEQ ID NO: 13 8] 
AATACAGCAG (SEQ ID NO: 140] 
TACAGCAGCT (SEQ ID NO: 142] 
CAGCAGCTAA (SEQ ID NO: 144) 
GCAGCTAACA (SEQ ID NO: 14 6) 
AGCTAACAAT (SEQ ID NO: 14 8) 
CTAACAATGC (SEQ ID NO: 150) 
AACAATGCTG (SEQ ID NO: 152) 
CAATGCTGCT (SEQ ID NO: 154) 
ATGCTGCTTG (SEQ ID NO: 156) 
GCTGCTTGTG (SEQ ID NO: 158) 
TGCTTGTGCC (SEQ ID NO: 160) 
CTTGTGCCTG (SEQ ID NO: 162) 
TGTGCCTGGC (SEQ ID NO: 164) 
TGCCTGGCTA (SEQ ID NO: 166) 
CCTGGCTAGA (SEQ ID NO: 168) 
TGGCTAGAAG (SEQ ID NO: 170) 
GCTAGAAGCA (SEQ ID NO: 172) 
TAGAAGCACA (SEQ ID NO: 174) 
GAAGCACAAG (SEQ ID NO: 176) 
AGCACAAGAG (SEQ ID NO: 178) 
CACAAGAGGA (SEQ ID NO: 180) 
CAAGAGGAGG (SEQ ID NO: 182) 
AGAGGAGGAA (SEQ ID NO: 184) 
AGGAGGAAGA (SEQ ID NO: 186) 
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GAAAAACATG (SEQ ID NO: 113); 
AAAACATGGA (SEQ ID NO: 115); 
AACATGGAGC (SEQ ID NO : 117) .; 
CATGGAGCAA (SEQ ID NO:119) ; 
TGGAGCAATC (SEQ ID NO: 121); 
GAGCAATCAC (SEQ ID NO: 123); 
GCAATCACAA (SEQ ID NO:125) ; 
AATCACAAGT (SEQ ID NO: 127) ; 
TCACAAGTAG (SEQ ID NO:129) ; 
ACAAGTAGCA (SEQ ID NO: 131) ; 
AAGTAGCAAT (SEQ ID NO: 133) ; 
GTAGCAATAC (SEQ ID NO: 135) ; 
AGCAATACAG (SEQ ID NO: 137); 
CAATACAGCA (SEQ ID NO: 139); 
ATACAGCAGC (SEQ ID NO: 141) ; 
ACAGCAGCTA "(SEQ ID NO: 143); 
AGCAGGTAAC (SEQ ID NO: 145) ; 
CAGCTAACAA (SEQ ID NO: 147) ; 
GCTAACAATG (SEQ ID NO: 149); 
• TAACAATGCT (SEQ ID NO:151); 
ACAATGCTGC (SEQ ID NO: 153); 
AATGCTGCTT (SEQ ID NO: 155); 
TGCTGCTTGT (SEQ ID NO: 157); 
CTGCTTGTGC (SEQ ID NO: 159); 
GCTTGTGCCT (SEQ ID NO: 161); 
TTGTGCCTGG (SEQ ID NO:163); 
GTGCCTGGCT (SEQ ID NO:165); 
GCCTGGCTAG (SEQ ID NO:167); 
CTGGCTAGAA (SEQ ID NO:169) ; 
GGCTAGAAGC (SEQ ID NO: 171); 
CTAGAAGCAC (SEQ ID NO:173); 
AGAAGCACAA (SEQ ID NO:175); 
AAGCACAAGA (SEQ ID NO:177) ; 
GCACAAGAGG (SEQ ID NO:179); 
ACAAGAGGAG (SEQ ID NO:181); 
AAGAGGAGGA (SEQ ID NO:183); 
GAGGAGGAAG (SEQ ID NO: 185) ; 
GGAGGAAGAG (SEQ ID NO:187); 
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GAGGAAGAGG (SEQ ID NO: 188) 
GGAAGAGGTG (SEQ ID NO: 190} 
AAGAGGTGGG (SEQ ID NO: 192) 
GAGGTGGGTT (SEQ ID NO: 194) 
GGTGGGTTTT (SEQ ID NO: 196) 
TGGGTTTTCC (SEQ ID NO: 198) 
GGTTTTCCAG (SEQ ID NO: 200) 
TTTTCCAGTC (SEQ ID NO: 202) 
TTCCAGTCAC (SEQ ID NO: 204) 
CCAGTCACAC (SEQ ID NO: 206) 
AGTCACACCT (SEQ ID NO: 208) 
TCACACCTCA (SEQ ID NO: 210) 

ACACCTCAGG (SEQ ID NO: 212) 

ACCTCAGGTA (SEQ ID NO: 214) 
.. CTCAGGTACC (SEQ ID NO.-216) 
. CAGGTACCTT (SEQ ID NO: 218) 
: GGTACGTTTA ( SEQ ID NO : 22 0 ) 

. TACCTTTAAG (SEQ ID NO:222), 

GCTTTAAGAC (SEQ ID NO: 224) , 

TTTAAGACCA (SEQ ID NO:226) j 
- . . TAAGAGCAAT (SEQ ID NO: 228) ; 

AGACCAATGA (SEQ ID NO: 230) ; 

ACCAATGACT (SEQ ID NO: 232) j 

CAATGACTTA (SEQ ID NO: 234) ; 

ATGACTTACA (SEQ ID NO: 23 6) ; 

GACTTACAAG (SEQ ID NO: 238) ; 

CTTACAAGGC (SEQ ID NO: 240) ; 

TACAAGGCAG (SEQ ID NO: 242) ; 

CAAGGCAGCT (SEQ ID NO: 244) ; 

AGGCAGCTGT (SEQ ID NO: 24 6) ; 

GCAGCTGTAG (SEQ ID NO: 248) ; 

AGCTGTAGAT (SEQ ID NO: 250) ; 

CTGTAGATCT (SEQ ID NO: 2 52); 

GTAGATCTTA (SEQ ID NO: 254) ; 

AGATCTTAGC (SEQ ID NO: 256) ; 

ATCTTAGCCA (SEQ ID NO: 258) ; 

CTTAGCCACT (SEQ ID NO: 2 60); 

TAGCCACTTT (SEQ ID NO:262); 
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AGGAAGAGGT (SEQ ID NO:189) ; 

GAAGAGGTGG (SEQ ID NO; 191); 

AGAGGTGGGT (SEQ ID NO: 193); 

AGGTGGGTTT (SEQ ID NO:195) ; 

GTGGGTTTTC (SEQ ID NO: 197) ; 

GGGTTTTCCA (SEQ ID NO: 199).; 

GTTTTCCAGT (SEQ ID NO: 201) ; 

TTTCCAGTCA (SEQ ID NO: 2 03) ; 

TCCAGTCACA (SEQ ID NO:205) ; 

CAGTCACACC (SEQ ID NO:207> ; 

GTCACACCTC (SEQ ID NO: 209) ; 

CACACCTCAG (SEQ ID NO: 211) ; 

CACCTCAGGT (SEQ ID NO:213); 

CCTCAGGTAC (SEQ ID NO: 215) ; 

TCAGGTACCT (SEQ ID NO: 217) ; 

AGGTACCTTT (SEQ ID NO: 219) ; 

GTACCTTTAA (SEQ ID NO: 221) ; 

ACCTTTAAGA (SEQ ID NO: 223).; 

CTTTAAGACC (SEQ ID NO:225); 

TTAAGACCAA (SEQ ID NO:227);, 
AAGACCAATG (SEQ ID NO: 229) ; 
GACCAATGAC (SEQ ID NO: 231) ; 
CCAATGACTT (SEQ ID NO: 233) ; 
AATGACTTAC (SEQ ID NO: 235) ; 
TGACTTACAA (SEQ ID NO: 237) ; 
ACTTACAAGG (SEQ ID NO:239); 
TTACAAGGCA (SEQ ID NO:241); 
ACAAGGCAGC (SEQ ID NO: 243) ; 
AAG6CAGCTG (SEQ ID NO: 24 5) ; 
GGCAGCTGTA (SEQ ID NO: 247) ; 
CAGCTGTAGA (SEQ ID NO: 249) ; 
GCTGTAGATC (SEQ ID NO: 251) ; 
TGTAGATCTT (SEQ ID NO: 2 53) ; 
TAGATCTTAG (SEQ ID NO: 255) ; 
GATCTTAGCC (SEQ ID NO: 257) ; 
TCTTAGCCAC (SEQ ID NO: 259) ; 
TTAGCCACTT (SEQ ID NO:261); 
AGCCACTTTT (SEQ ID NO: 263) ; 



WO 95/21912 



PCT/AU95/00063 



GCCACTTTTT (SEQ ID NO: 2 64) 
CACTTTTTAA (SEQ ID NO: 266) 
CTTTTTAAAA (SEQ ID NO: 268) 
TTTTAAAAGA (SEQ ID NO: 270) 
TTAAAAGAAA (SEQ ID NO: 272) 
AAAAGAAAAG (SEQ ID NO: 274) 
AAGAAAAGGG (SEQ ID NO: 2 76) 
GAAAAGGGGG (SEQ ID NO: 278) 
AAAGGGGGGA (SEQ ID NO: 280) 
AGGGGGGACT (SEQ ID NO: 2 82) 
GGGGGACTGG (SEQ ID NO: 2 84) 
GGGACTGGAA (SEQ ID NO: 286) 
GACTGGAAGG (SEQ ID NO: 288) 
CTGGAAGGGC (SEQ ID NO: 290) 
GGAAGGGCTA (SEQ ID NO: 292) 
AAGGGCTAAT ( SEQ ID NO : 294 ) 
GGGCTAATTC (SEQ ID NO: 296) 
GCTAATTCAC (SEQ ID NO: 298) 
TAATTCACTC (SEQ ID NO: 3 00) 
ATTCACTCCC (SEQ ID NO : 302 ) 
TCACTCCCAA (SEQ ID NO: 3 04) 
ACTCCCAAAG (SEQ ID NO: 306) 
TCCCAAAGAA (SEQ ID NO: 3 08) 
CCAAAGAAGA (SEQ ID NO:310) 
AAAGAAGACA (SEQ ID NO: 312) , 
AGAAGACAAG (SEQ ID NO: 314) t 
AAGACAAGAT (SEQ ID NO:316); 
GACAAGATAT (SEQ ID NO: 3 18) ; 
CAAGATATCC (SEQ ID NO: 320) ; 
AGATATCCTT (SEQ ID NO: 322) j 
ATATCCTTGA (SEQ ID NO: 324) ; 
ATCCTTGATC (SEQ ID NO: 326) j 
CCTTGATCTG ( SEQ ID NO : 3 2 8 ) ; 
TTGATCTGTG (SEQ ID NO: 330) ; 
GATCTGTGGA (SEQ ID NO: 332) ; 
TCTGTGGATC ( SEQ ID NO : 3 34 ) ; 
TGTGGATCTA (SEQ ID NO: 336) ; 
TGGATCTACC (SEQ ID NO:338) ; 



-217- 

CCACTTTTTA (SEQ ID NO: 265) 
ACTTTTTAAA (SEQ ID NO: 267) 

. TTTTTAAAAG (SEQ ID NO: 269) 
TTTAAAAGAA (SEQ ID NO: 271) 
TAAAAGAAAA (SEQ ID NO: 273) 
AAAGAAAAGG (SEQ ID NO:275) 
AGAAAAGGGG (SEQ ID NO:277) 
AAAAGGGGGG (SEQ ID NO: 279) 
AAGGGGGGAC (SEQ ID NO:281) 
GGGGGGACTG (SEQ ID NO: 283) 
GGGGACTGGA (SEQ ID NO: 285) 
GGACTGGAAG (SEQ ID NO: 287) 
ACTGGAAGGG (SEQ- ID NO:289) 
TGGAAGGGCT (SEQ ID NO: 291) 
GAAGGGCTAA (SEQ ID NO: 293) 
AGGGCTAATT (SEQ ID NO:295) 
GGCTAATTCA (SEQ ID NO: 297) 
CTAATTCACT (SEQ ID NO:299) 
AATTCACTCC (SEQ ID NO:301) 
TTCACTCCCA (SEQ ID NO:303), 
CACTCCCAAA (SEQ ID NO: 3 05), 
CTCCCAAAGA (SEQ ID NO:307) J 
CCCAAAGAAG (SEQ ID NO:309); 
CAAAGAAGAC (SEQ ID NO:311) ; 

AAGAAGACAA (SEQ ID NO: 313) j 

GAAGACAAGA (SEQ ID NO: 3 15) j 

AGACAAGATA (SEQ ID NO: 317); 

ACAAGATATC (SEQ ID NO: 319) ; 

AAGATATCCT (SEQ ID NO: 321) ; 

GATATCCTTG (SEQ ID NO: 323) ; 

TATCCTTGAT (SEQ ID NO: 325) ; 

TCCTTGATCT (SEQ ID NO: 3 27) ; 

CTTGATCTGT (SEQ ID NO: 329) ; 

TGATCTGTGG (SEQ ID NO: 331) ; 

ATCTGTGGAT ( SEQ ID NO : 3 3 3 ) ; 

CTGTGGATCT ( SEQ ID NO : 3 3 5 ) ; 

GTGGATCTAC (SEQ ID NO: 337) j 

GGATCTACCA (SEQ ID NO:339) ; 
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GATCTACCAC ( SEQ ID NO : 34 0 ) ; 

TCTACCACAC (SEQ ID NO: 342); 

TACCACACAC (SEQ ID NO:344) ; 

CCACACACAA (SEQ ID NO:346); 

ACACACAAGG (SEQ ID NO: 348) ; 

ACACAAGGCT (SEQ ID NO:350) ; 

ACAAGGCTAC { SEQ ID NO : 3 5 2 ) ; 

AAGGCTACTT (SEQ ID NO: 3 54) ; 

GGCTACTTCC (SEQ ID NO: 356) ; 

CTACTTCCCT ( SEQ ID NO : 3 5 8 ) ; 

ACTTCCCTGA (SEQ ID NO: 360); 

TTCCCTGATT ( SEQ ID NO : 3 62 ) ; 

CCCTGATTGG (SEQ ID NO: 3 64) ; 

CTGATTGGCA (SEQ ID NO: 366) ; 

GATTGGCAGA (SEQ ID NO: 368) ; 

TTGGCAGAAC (SEQ ID NO: 3 70) ; 

GGCAGAACTA (SEQ ID NO:372) ; 

GAGAACTACA ( SEQ ID NO:374) ; 

GAACTACACA (SEQ ID NO: 3 76) ; 
ACTACACACC (SEQ ID NO: 378) ; 
TACACACCAG (SEQ ID NO: 380) ; 

CACACCAGGG (SEQ ID NO: 382) ; 
CACCAGGGCC (SEQ ID NO: 3 84) ; 
CCAGGGCCAG (SEQ ID NO:386) ; 
AGGGCCAGGG (SEQ ID NO:388) ; 
GGCCAGGGGT (SEQ ID NO: 3 90) ; 
CCAGGGGTCA (SEQ ID NO:392); 
AGGGGTCAGA (SEQ ID NO: 3 94) ; 
GGGTCAGATA (SEQ ID NO: 396) ; 
GTCAGATATC (SEQ ID NO: 398) ; 
CAGATATCCA (SEQ ID NO:400) ; 
GATATCCACT (SEQ ID NO: 4 02) ; 
TATCCACTGA (SEQ ID NO: 4 04) ; 
TCCACTGACC (SEQ ID NO: 4 06) ; 
CACTGACCTT (SEQ ID NO: 4 08) ; 
CTGACCTTTG ( SEQ ID NO : 4 1 0 ) ; 
GACCTTTGGA (SEQ ID NO: 412) ; 
CCTTTGGATG (SEQ ID NO:414); 
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ATCTACCACA ( SEQ ID NO : 3 4 1 ) ; 

CTACCACACA (SEQ ID NO:343); 

ACCACACACA (SEQ ID NO: 345) ; 

CACACACAAG (SEQ ID NO:347) ; 

CACACAAGGC (SEQ ID NO: 349) ; 

CACAAGGCTA (SEQ ID NO: 351) ; 

CAAGGCTACT (SEQ ID NO:353) ; 

AGGCTACTTC (SEQ ID NO: 355) ; 

GCTACTTCCC (SEQ ID NO: 357) ; 

TACTTCCCTG (SEQ ID NO: 359) ; 

CTTCCCTGAT (SEQ ID NO: 361) ; 

TCCCTGATTG (SEQ ID NO: 363) ; 

CCTGATTGGC (SEQ ID NO: 365) ; 

TGATTGGCAG (SEQ ID NO:367) ; 
ATTGGCAGAA (SEQ ID NO: 369) ; 
TGGCAGAACT (SEQ ID NO: 371) ; 
GCAGAACTAC (SEQ ID NO: 3 73) 
AGAACTACAC (SEQ ID NO: 3 75) ; 
AACTACACAC (SEQ ID NO: 377) ; 
CTACACACCA (SEQ ID NO: 379) ; 
ACACACCAGG (SEQ ID NO:381) ; ; 
ACACCAGGGC (SEQ ID NO: 383) ; 
ACCAGGGCCA (SEQ ID NO:385) ; 
CAGGGCCAGG (SEQ ID NO: 3 87) ; 
GGGCCAGGGG (SEQ ID NO:389) ; 
GCCAGGGGTC (SEQ ID NO:391) ; 
CAGGGGTCAG (SEQ ID NO: 3 93) ; 
GGGGTCAGAT (SEQ ID NO:395) ; 
GGTCAGATAT (SEQ ID NO:397) ; 
TCAGATATCC (SEQ ID NO:399) ; 
AGATATCCAC (SEQ ID NO: 4 01) ; 
ATATCCACTG (SEQ ID NO: 4 03) ; 
ATCCACTGAC (SEQ ID NO:405) ; 
CCACTGACCT (SEQ ID NO: 407) ; 
ACTGACCTTT (SEQ ID NO: 409) ; 
TGACCTTTGG ( SEQ ID NO : 4 1 1 ) ; 
ACCTTTGGAT (SEQ ID NO: 4 13) ; 
CTTTGGATGG (SEQ ID NO: 415) ; 
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TTTGGATGGT (SEQ ID NO: 4 16 
TGGATGGTGC (SEQ ID NO: 418 
GATGGTGCTA (SEQ ID NO: 420 
TGGTGCTACA (SEQ ID NO: 422 
GTGCTACAAG (SEQ ID NO: 424 
GCTACAAGCT (SEQ ID NO:426 
TACAAGCTAG (SEQ ID NO: 428 
CAAGCTAGTA (SEQ ID NO: 430! 
AGCTAGTACC (SEQ ID NO: 432] 
CTAGTACCAG (SEQ ID NO: 434] 
AGTACCAGTT (SEQ ID NO: 436] 
TACCAGTTGA (SEQ ID NO: 438] 
CCAGTTGAGC (SEQ ID NO: 44 01 
AGTTGAGCCA (SEQ ID NO: 442] 
TTGAGCCAGA (SEQ ID NO: 444] 
GAGCCAGATA (SEQ ID NO: 446] 
GCCAGATAAG (SEQ ID NO: 448] 
CAGATAAGGT (SEQ ID NO: 450] 
GATAAGGTAG (SEQ ID NO: 4 52] 
TAAGGTAGAA (SEQ ID NO: 4 54] 
AGGTAGAAGA (SEQ ID NO:456) 
GTAGAAGAGG (SEQ ID NO: 458) 
AGAAGAGGCC (SEQ ID NO: 460) 
AAGAGGCCAA (SEQ ID NO: 462) 
GAGGCCAATA (SEQ ID NO: 464) 
GGCCAATAAA (SEQ ID NO: 466) 
CCAATAAAGG (SEQ ID NO:468) 
AATAAAGGAG (SEQ ID NO: 470) 
TAAAGGAGAG (SEQ ID NO: 472) 
AAGGAGAGAA (SEQ ID NO: 474) 
GGAGAGAACA (SEQ ID NO: 476) 
AGAGAACACC (SEQ ID NO:478) 
AGAACACCAG (SEQ ID NO:480) 
AACACCAGCT (SEQ ID NO: 482) 
CACCAGCTTG (SEQ ID NO: 484) 
CCAGCTTGTT (SEQ ID NO: 486) 
AGCTTGTTAC (SEQ ID NO: 488) 
CTTGTTACAC (SEQ ID NO: 490) 
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TTGGATGGTG 


(SEQ 


ID 


NO 


:417) ; 


GGATGGTGCT 


(SEQ 


ID 


NO 


:419) ; 


ATGGTGCTAC 


(SEQ 


ID 


NO 


:421) ; 


GGTGCTACAA 


(SEQ 


ID 


NO 


:423) ; 


TGCTACAAGC 


(SEQ 


ID 


NO 


:425) ; 


CTACAAGCTA 


(SEQ 


ID 


NO 


:427) ; 


ACAAGCTAGT 


(SEQ 


ID 


NO 


:429) ; 


AAGCTAGTAC 


(SEQ 


ID 


NO 


:431) ; 


GCTAGTACCA 


(SEQ 


ID 


NO 


:433) ; 


TAGTACCAGT 


(SEQ 


ID 


NO 


:435) ; 


GTACCAGTTG 


(SEQ 


ID 


NO 


:437) ; 


ACCAGTTGAG 


(SEQ 


ID 


NO. 


:439) ; 


CAGTTGAGCC 


(SEQ 


ID 


NO: 


:441) ; 


GTTGAGCCAG 


(SEQ 


ID 


NO: 


-443) ; 


TGAGCCAGAT 


(SEQ 


ID 


NO: 


445) ; 


AGCCAGATAA 


(SEQ 


ID 


NO: 


447) ; 


CCAGATAAGG 


(SEQ 


ID 


NO: 


449) ; 


AGATAAGGTA 


(SEQ 


ID 


NO: 


451) ; 


ATAAGGTAGA 


(SEQ 


ID 


NO: 


453) ; 


AAGGTAGAAG 


(SEQ 


ID 


NO: 


455) ; 


GGTAGAAGAG 


(SEQ 


ID 


NO: 


457) ; 


TAGAAGAGGC 


(SEQ 


ID 


NO: 


459) ; 


GAAGAGGCCA 


(SEQ 


ID 


NO: 


461) ; 


AGAGGCCAAT 


(SEQ 


ID 


NO: 


463) ; 


AGGCCAATAA 


(SEQ 


ID 


NO: 


465) ; 


GCCAATAAAG 


(SEQ 


ID 


NO: 


467) ; 


CAATAAAGGA 


(SEQ 


ID 


NO: 


469) ; 


ATAAAGGAGA 


(SEQ 


ID 


NO: 


471) ; 


AAAGGAGAGA 


(SEQ 


ID 


NO: 


473) ; 


AGGAGAGAAC 


(SEQ 


ID 


NO: 


475) ; 


GAGAGAACAC 


(SEQ 


ID 


NO: 


477) ; 


GAGAACACCA 


(SEQ 


ID 


NO: 


479) ; 


GAACACCAGC 


(SEQ 


ID 


NO: 


481) ; 


ACACCAGCTT 


(SEQ 


ID 


NO: 


483) ; 


ACCAGCTTGT 


(SEQ 


ID 


NO: 


485) ; 


CAGCTTGTTA 


(SEQ 


ID 


NO: 


487) ; 


GCTTGTTACA 


(SEQ 


ID 


NO: 


489) ; 


TTGTTACACC 


(SEQ 


ID 


NO: 


491) ; 
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TGTTACACCC (SEQ ID NO: 4 92) ; 
TTACACCCTG (SEQ ID NO: 4 94); 
ACACCCTGTG (SEQ ID NO: 496) ; 
ACCCTGTGAG (SEQ ID NO: 498) ; 
CCTGTGAGCC (SEQ ID NO: 500) ; 
TGTGAGCCTG (SEQ ID NO:502); 
TGAGCCTGCA (SEQ ID NO: 504) ; 

AGCCTGCATG (SEQ ID NO: 506); 

CCTGCATGGA (SEQ ID NO: 5 08); 

TGCATGGAAT (SEQ ID NO:510) ; 

CATGGAATGG (SEQ ID NO:512) ; 

TGGAATGGAT (SEQ ID NO: 514) ; 

GAATGGATGA (SEQ ID NO: 516) ; 

ATGGATGACC (SEQ ID NO:518); 

GGATGACCCT (SEQ ID NO:520); 

ATGACCCTGA (SEQ ID NO:522); 

GACCCTGAGA (SEQ ID NO: 524) ; 

CCCTGAGAGA (SEQ ID NO: 526) ; 

CTGAGAGAGA (SEQ ID NO: 528) ; 

GAGAGAGAAG (SEQ ID NO: 530) ; 

GAGAGAAGTG (SEQ ID NO: 532) ; 

GAGAAGTGTT (SEQ ID NO: 534); 

GAAGTGTTAG (SEQ ID NO: 53 6); 
AGTGTTAGAG (SEQ ID NO: 538); 
TGTTAGAGTG (SEQ ID NO: 540); 
TTAGAGTGGA (SEQ ID NO: 542); 
AGAGTGGAGG (SEQ ID NO: 544) ; 
AGTGGAGGTT • (SEQ ID NO:546) ; 
TGGAGGTTTG ( SEQ ID NO : 54 8 ) ; 
GAGGTTTGAC (SEQ ID NO: 550); 
GGTTTGACAG (SEQ ID NO: 552) ; 
TTTGACAGCC (SEQ ID NO: 554) ; 
TGACAGCCGC (SEQ ID NO: 556); 
ACAGCCGCCT (SEQ ID NO:558); 
AGCCGCCTAG (SEQ ID NO:560); 
CCGCCTAGCA (SEQ ID NO:562); 
GCCTAGCATT (SEQ ID NO: 564) ; 
CTAGCATTTC (SEQ ID NO: 566) ; 



GTTACACCCT (SEQ ID NO:493) ; 
TACACCCTGT (SEQ ID NO:495); 
CACCCTGTGA (SEQ ID NO: 497) ; 
CCCTGTGAGC ( SEQ ID NO : 4 9 9 ) ; 
CTGTGAGCCT (SEQ ID NO:501); 
GTGAGCCTGC (SEQ ID NO: 503) ; 

GAGCCTGCAT (SEQ ID NO: 5 05) ; 

GCCTGCATGG (SEQ ID NO: 507) ; 

CTGCATGGAA (SEQ ID NO:509) ; 

GCATGGAATG (SEQ ID NO: 511) ; 

ATGGAATGGA (SEQ ID NO: 513) ; 

GGAATGGATG (SEQ ID NO:515); 

AATGGATGAC (SEQ ID NO: 517) ; 

TGGATGACCC (SEQ ID NO: 519) ; 

GATGACCCTG (SEQ ID NO: 521) ; 

TGACCCTGAG (SEQ ID NO: 523) ; 

ACCCTGAGAG (SEQ ID NO : 525 ) ; 

CCTGAGAGAG (SEQ ID NO: 527),; 

TGAGAGAGAA (SEQ ID NO:529) ; 
AGAGAGAAGT (SEQ ID NO: 531) ; 
AGAGAAGTGT (SEQ ID NO: 533) ; 
AGAAGTGTTA (SEQ ID NO: 535) ; 
AAGTGTTAGA (SEQ ID NO:537); 
GTGTTAGAGT (SEQ ID NO: 539) ; 
GTTAGAGTGG (SEQ ID NO: 541) ; 
TAGAGTGGAG (SEQ ID NO: 543) ; 
GAGTGGAGGT (SEQ ID NO: 545) ; 
GTGGAGGTTT (SEQ ID NO: 547) ; 
GGAGGTTTGA (SEQ ID NO: 549) ; 
AGGTTTGACA (SEQ ID NO: 551) ; 
GTTTGACAGC (SEQ ID NO:553); 
TTGACAGCCG (SEQ ID NO: 555) ; 
GACAGCCGCC (SEQ ID NO:557); 
CAGCCGCCTA (SEQ ID NO: 559); 
GCCGCCTAGC (SEQ ID NO: 561) ; 
CGCCTAGCAT (SEQ ID NO: 563) ; 
CCTAGCATTT (SEQ ID NO: 565) ; 
TAGCATTTCA (SEQ ID NO:567) ; 
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AGCATTTCAT (SEQ ID NO: 568) ; 

CATTTCATCA (SEQ ID NO:570); 

TTTCATCACG (SEQ ID NO: 572); 

TCATCACGTG (SEQ ID NO:574); 

ATCACGTGGC (SEQ ID NO: 576) ; 

CACGTGGCCC (SEQ ID NO: 578) ; 

CGTGGCCCGA (SEQ ID NO: 580); 

TGGCCCGAGA (SEQ ID NO: 582) ; 

GCCCGAGAGC (SEQ ID NO: 5 84); 

CCGAGAGCTG (SEQ ID NO: 586); 

GAGAGCTGCA (SEQ ID NO: 588); 

GAGCTGCATC (SEQ ID NO:590) ; 

GCTGCATCCG (SEQ ID NO: 592); 
TGCATCCGGA (SEQ ID NO: 594) ; 
CATCCGGAGT (SEQ ID NO: 596) ; 
TCCGGAGTAC (SEQ ID NO:598); 
CGGAGTACTT (SEQ ID NO:600) ; 
GAGTACTTCA (SEQ ID NO: 6 02) ; 
GTACTTCAAG (SEQ ID NO: 604) ; 
ACTTCAAGAA (SEQ. ID NO:606); 
TTCAAGAACT (SEQ ID NO: 608); 
CAAGAACTGC (SEQ ID NO: 610); 
AGAACTGCTG (SEQ ID NO: 612); 



GCATTTCATC (SEQ ID NO: 5 69); 

ATTTCATCAC (SEQ ID NO: 571); 

TTCATCACGT (SEQ ID NO: 5 73); 

CATCACGTGG (SEQ ID NO: 575); 

TCACGTGGCC (SEQ ID NO: 577); 

ACGTGGCCCG (SEQ ID NO:579); 

GTGGCCCGAG (SEQ ID NO:581); 

GGCCCGAGAG (SEQ ID NO:583); 

CCCGAGAGCT (SEQ ID NO: 585); 

CGAGAGCTGC (SEQ ID NO: 587) ; 
AGAGCTGCAT ( SEQ ID NO : 5 8 9 ) ; 
AGCTGCATCC (SEQ ID NO: 591) ; 
CTGCATCCGG (SEQ ID NO: 593) ; 
GCATCCGGAG (SEQ ID NO: 595) ; 
ATCCGGAGTA (SEQ ID NO: 597) ; 
CCGGAGTACT (SEQ ID NO: 599) ; 
GGAGTACTTC (SEQ ID NO:601); 
AGTACTTCAA (SEQ ID NO:603); 
TACTTCAAGA (SEQ ID NO:605); 
CTTCAAGAAC (SEQ ID NO:607); 
TCAAGAACTG (SEQ ID NO:609); 
AAGAACTGCT (SEQ ID NO:611) ; 
GAACTGCTGA (SEQ ID NO: 613). 



49. A method according to claim 44 wherein said deletion encompasses at least one 
of the following decanucleotides from the LTR region of HTV-1 strain NL4-3: 



GCTTTTTGCC 
TTTTTGCCTG 
TTTGCCTGTA 
TGCCTGTACT 
CCTGTACTGG 
TGTACTGGGT 
TACTGGGTCT 
CTGGGTCTCT 
GGGTCTCTCT 
GTCTCTCTGG 
CTCTCTGGTT 



(SEQ ID NO: 652) ; 
(SEQ ID NO: 654) ; 
(SEQ ID NO: 656) ; 
(SEQ ID NO: 658) ; 
(SEQ ID NO: 660) ; 
(SEQ ID NO: 662) ; 
(SEQ ID NO: 664) ; 
(SEQ ID NO: 666) ; 
(SEQ ID NO: 668) ; 
(SEQ ID NO: 670) ; 
(SEQ ID NO: 672) ; 



CTTTTTGCCT 
TTTTGCCTGT 
TTGCCTGTAC 
GCCTGTACTG 
CTGTACTGGG 
GTACTGGGTC 
ACTGGGTCTC 
TGGGTCTCTC 
GGTCTCTCTG 
TCTCTCTGGT 
TCTCTGGTTA 



(SEQ ID NO: 653) ; 
(SEQ ID NO: 655) ; 
(SEQ ID NO: 657) ; 
(SEQ ID NO: 659) ; 
(SEQ ID NO: 661) ; 
(SEQ ID NO: 663) ; 
(SEQ ID NO: 665) ; 
(SEQ ID NO: 667) ; 
(SEQ ID NO: 669) ; 
(SEQ ID NO: 671) ; 
(SEQ ID NO: 673) ; 
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CTCTGGTTAG 

CTGGTTAGAC 

GGTTAGACCA 

TTAGACCAGA 

AGACCAGATC 

ACCAGATCTG 

CAGATCTGAG 

GATCTGAGCC 

TCTGAGCCTG 

TGAGCCTGGG 

AGCCTGGGAG 

CCTGGGAGCT 

TGGGAGCTCT 

GGAGCTCTCT 

AGCTCTCTGG 

CTCTCTGGCT 

CTCTGGCTAA 

CTGGCTAACT 

GGCTAACTAG 

CTAACTAGGG 

AACTAGGGAA 

CTAGGGAACC 

AGGGAACCCA 

GGAACCCACT 

AACCCACTGC 

CCCACTGCTT 

CACTGCTTAA 

CTGCTTAAGC 

GCTTAAGCCT 

TTAAGCCTCA 

AAGCCTCAAT 

GCCTCAATAA 

CTCAATAAAG 

CAATAAAGCT 

ATAAAGCTTG 

AAAGCTTGCC 

AGCTTGCCTT 

CTTGCCTTGA 



(SEQ ID NO: 674! 
(SEQ ID NO: 676i 
(SEQ ID NO: 678] 
(SEQ ID NO: 680] 
(SEQ ID NO: 682) 
(SEQ ID NO: 684) 
(SEQ ID NO: 686) 
(SEQ ID NO: 688) 
(SEQ ID NO: 690) 
(SEQ ID NO: 692) 
(SEQ ID NO: 694) 
(SEQ ID NO: 696) 
(SEQ ID NO: 698) 
(SEQ ID NO:700) 
(SEQ ID NO: 702) 
(SEQ ID NO: 704) 
(SEQ ID NO:706) 
(SEQ ID NO: 708) 
(SEQ ID NO: 710) 
(SEQ ID NO: 712) 
(SEQ ID NO: 714) 
(SEQ ID NO: 716) 
(SEQ ID NO: 718) 
(SEQ ID NO:720) 
(SEQ ID NO: 722) 
(SEQ ID NO: 724) 
(SEQ ID NO: 726) 
(SEQ ID NO: 728) 
(SEQ ID NO: 730) 
(SEQ ID NO:732) 
(SEQ ID NO: 734) ; 
(SEQ ID NO: 736) , 
(SEQ ID NO:738) 
(SEQ ID NO: 740) 
(SEQ ID NO: 742) 
(SEQ ID NO: 744) 
(SEQ ID NO: 746) 
(SEQ ID NO: 748) ; 
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TCTCTGGTTA 


(SEQ 


ID NO 


:675) 


TGGTTAGACC 


(SEQ 


IE 


i NO 


:677) 


GTTAGACCAG 


(SEQ 


ID 


NO 


: 679) 


TAGACCAGAT 


(SEQ 


ID 


NO 


:681) 


GACCAGATCT 


(SEQ 


ID 


NO 


:683) 


CCAGATCTGA 


(SEQ 


ID 


NO 


:685) 


AGATCTGAGC 


(SEQ 


ID 


NO 


:687) 


ATCTGAGCCT 


(SEQ 


ID 


NO 


:689) 


CTGAGCCTGG 


(SEQ 


ID 


NO 


:691) 


GAGCCTGGGA 


(SEQ 


ID 


NO 


:693) 


GCCTGGGAGC 


(SEQ 


ID 


NO 


:695) 


CTGGGAGCTC 


(SEQ 


ID 


NO 


:697) 


GGGAGCTCTC 


(SEQ 


ID 


NO 


:699) 


GAGCTCTCTG 


(SEQ 


ID 


NO 


:701) 


GCTCTCTGGC . 


(SEQ 


ID 


NO 


:703) 


TCTCTGGCTA 


(SEQ 


ID 


NO 


:705) 


TCTGGCTAAC 


(SEQ 


ID 


NO 


:707) 


TGGCTAACTA 


(SEQ 


ID 


NO 


709) 


GCTAACTAGG 


(SEQ 


ID 


NO: 


711) 


TAACTAGGGA 


: (SEQ 


ID 


NO: 


713) 


ACTAGGGAAC 


(SEQ 


ID 


NO: 


715) , 


TAGGGAACCC 


(SEQ 


ID 


NO: 


717) , 


GGGAACCCAC 


(SEQ 


ID 


NO: 


719) , 


GAACCCACTG 


(SEQ 


ID 


NO: 


721) , 


ACCCACTGCT 


(SEQ 


ID 


NO: 


723) j 


CCACTGCTTA 


(SEQ 


ID 


NO: 


725) ; 


ACTGCTTAAG 


(SEQ 


ID 


NO: 


727) ; 


TGCTTAAGCC 


(SEQ 


ID 


NO: 


729) j 


CTTAAGCCTC 


(SEQ 


ID 


NO: 


731) ; 


TAAGCCTCAA 


(SEQ 


ID 


NO: 


733) ; 


AGCCTCAATA 


(SEQ 


ID 


NO: 


735) ; 


CCTCAATAAA 


(SEQ 


ID 


NO: 


737) ; 


TCAATAAAGC 


(SEQ 


ID 


NO: 


739) ; 


AATAAAGCTT 


(SEQ 


ID 


NO: 


741) ; 


TAAAGCTTGC 


(SEQ 


ID 


NO: 


743) ; 


AAGCTTGCCT 


(SEQ 


ID 


NO: 


745) ; 


GCTTGCCTTG 


(SEQ 


ID 


NO: 


747) ; 


TTGCCTTGAG 


(SEQ 


ID 


NO: 


749) ; 
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TGCCTTGAGT 


(SEQ 


ID 


NO 


:750) 


CCTTGAGTGC 


(SEQ 


ID 


NO 


:752) 


TTGAGTGCTT 


(SEQ 


ID 


NO 


:754) 


GAGTGCTTCA 


(SEQ 


ID 


NO 


:756) 


GTGCTTCAAG 


(SEQ 


ID 


NO 


:758) 


GCTTCAAGTA 


(SEQ 


ID 


NO 


:760) 


TTCAAGTAGT 


(SEQ 


ID 


NO 


:762) 


CAAGTAGTGT 


(SEQ 


ID 


NO 


:764) 


AGTAGTGTGT 


(SEQ 


ID 


NO 


:766) 


TAGTGTGTGC 


(SEQ 


ID 


NO. 


:768) 


GTGTGTGCCC 


(SEQ 


ID 


NO: 


:770) 


GTGTGCCCGT 


(SEQ 


ID 


NO: 


772) 


GTGCCCGTCT 


(SEQ 


ID 


NO: 


774) 


GCCCGTCTGT 


(SEQ 


ID 


NO: 


776) 


CCGTCTGTTG ■ 


(SEQ 


ID 


NO: 


778) 


GTCTGTTGTG 


(SEQ 


ID 


NO: 


780) 


CTGTTGTGTG 


(SEQ 


ID 


NO: 


782) 


GTTGTGTGAC 


(SEQ 


ID 


NO: 


784) 


TGTGTGACTC 


(SEQ 


ID 


NO: 


786) 


TGTGTGACTC 


(SEQ 


ID 


NO: 


788) 


TGTGAGTCTG 


(SEQ 


ID 


NO: 


790) 


TGACTCTGGT 


(SEQ 


ID 


NO: 


792) 


ACTCTGGTAA 


(SEQ 


ID 


NO: 


794) 


TCTGGTAACT 


(SEQ 


ID 


NO: 


796) 


TGGTAACTAG 


(SEQ 


ID 


NO: 


798) 
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GCCTTGAGTG 


(SEQ 


ID 


NO 


:751) ; 


CTTGAGTGCT 


(SEQ 


ID 


NO 


:753) ; 


TGAGTGCTTC 


(SEQ 


ID 


NO 


:755) ; 


AGTGCTTCAA 


(SEQ 


ID 


NO 


:757) ; 


TGCTTCAAGT 


(SEQ 


ID 


NO 


:759) ; 


CTTCAAGTAG 


(SEQ 


ID 


NO 


:761) ; 


TCAAGTAGTG 


(SEQ 


ID 


NO 


:763) ; 


AAGTAGTGTG 


(SEQ 


ID 


NO 


:765) ; 


GTAGTGTGTG 


(SEQ 


ID 


NO 


:767) ; 


AGTGTGTGCC 


(SEQ 


ID 


NO 


:769) ; 


TGTGTGCCCG 


(SEQ 


ID 


NO 


:771) ; 


TGTGCCCGTC 


(SEQ 


ID 


NO. 


•773); 


TGCCCGTCTG 


(SEQ 


ID 


NO: 


775) ; 


CCCGTCTGTT 


(SEQ 


ID 


NO: 


777) ; 


CGTCTGTTGT 


(SEQ 


ID 


NO: 


779) ; 


TCTGTTGTGT 


(SEQ 


ID 


NO: 


781) ; 


TGTTGTGTGA 


(SEQ 


ID 


NO: 


783); 


TTGTGTGACT 


(SEQ 


ID 


NO: 


785) ; 


GTGTGACTCT 


(SEQ 


ID 


NO: 


787) ; 


GTGTGACTCT 


(SEQ 


ID 


NO: 


789) ; 


GTGACTCTGG 


(SEQ 


ID 


NO: 


791) ; 


GACTCTGGTA 


(SEQ 


ID 


NO: 


793) ; 


CTCTGGTAAC 


(SEQ 


ID 


NO: 


795) ; 


CTGGTAACTA 


(SEQ 


ID 


NO: 


797) ; 


GGTAACTAGA 


(SEQ 


ID 


NO: 


799) . 



50. A method according to claim 24 wherein said HIV-1 isolate has the identifying 
characteristics of HIV-1 isolate ECACC V94101706 or HIV-1 isolate ECACC 
V941031169 or isolate C54 deposited at ECACC on 14 February, 1995 under Accession 
Number , 

51. A method according to claim 20 wherein the HIV-1 isolate is reactive to 
antibodies to a glycoprotein of HIV-1, is capable of inducing an immune response of 
gag, pol and/or env and which is incapable of directing synthesis of a ne/gene product 
or a full length nef gene product. 
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52. A method according to claim 5 1 wherein the glycoprotein is at least one of gp4 1 - 
45, gpl20 and/or gpl60. 

53 . A method for obtaining a preparation of non-pathogenic HIV- 1 from a biological 
sample, said method comprising co-culturing PBMCs from said biological sample from 
an individual putatively infected with said non-pathogenic HTV-1 with HIV-1 
seronegative donor PBMCs depleted for CD8+ cells, harvesting the PBMCs and 
supernatant fluid every from about 5 to about 10 days and adding fresh medium with 
CD8+ depleted PBMCs with said fresh medium and isolating said virus from the 
supernatant fluid. 

54. A method according to claim 53 wherein the seronegative PBMCs are pretreated 
with a cytokine for a time and under conditions sufficient to enhance HTV-1 replication. 

55. A method according to claim 54 wherein the cytokine is M-CSF. 

56. A method according to claim 53 or 54 or 55 wherein the HIV-1 seronegative 
donor PBMCs are first stimulated with PHA and/or IL-2. 

57. A method according to claim 53 wherein the PBMCs from a putative infected 
individual are also CD8+ depleted prior to coculturing with seronegative donor PBMCs. 

58. A method according to claim 53 or 57 wherein the cocultured cells are subjected 
to UV irradiation. 

59. A method for obtaining a preparation of non-pathogenic HTV-1 from a biological 
sample, said method comprising co-culturing monocytes from said biological sample 
from an individual putatively infected with said non-pathogenic HTV-1 with HTV-1 
seronegative donor PBMCs depleted for CD8+ cells, harvesting the monocytes and 
PBMCs and supernatant fluid every from about 5 to about 10 days and adding fresh 
medium with CD8+ depleted PBMCs with said fresh medium and isolating said vims 
from the supernatant fluid. 
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60. A method according to claim 59 wherein the seronegative PBMCs are pretreated 
with a cytokine for a time and under conditions sufficient to enhance HIV-1 replication. 

61. A method according to claim 60 wherein the cytokine is M-CSF. 

62. A method according to claim 59 or 60 or 61 wherein the HTV-1 seronegative 
donor PBMCs are first stimulated with PHA and/or IL-2. 

63. A method according to claim 59 wherein the monocytes from a putative infected 
individual are also CD8+ depleted prior to coculturing with seronegative donor PBMCs. 

64. A method according to claim 59 or 63 wherein the cocultured cells are subjected 
to UV irradiation. 

65. A method for identifying or screening for compounds capable of reducing or 
otherwise interfering with HTV-1 replication, said method comprising contacting a 
compound to be tested with a cell or cell extract containing or capable of containing a 
nef gene product fused to a reporter molecule capable of giving an identifiable signal 
and screening for a compound which inhibits said signal. 

66. A method according to claim 65 wherein the compound is an antibody to nef 
gene product or a part thereof. 

67. A method according to claim 65 wherein the compound is a DNA targeting agent 
and inhibits transcription of the nef gene. 

68. A compound capable of inhibiting nef gene product function or nef gene 
transcription. 



69. A compound capable of inhibiting nef gene product function or nef gene 
transcription identified according to the method of claim 65 or 66 or 67. 
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70. A therapeutic composition useful for inhibiting or reducing productive infection 
by a pathogenic strain of HIV-1 and/or for vaccinating an individual against the 
development of AIDS or AIDS-related diseases, said composition comprising a non- 
pathogenic strain of HIV- 1 according to any one of claims 1 to 16 and one or more 
pharmaceutical acceptable carriers and/or diluents. 

71. A therapeutic composition according to claim 70 wherein said HIV-1 isolate 
comprises genetic material that directs expression of antisense or ribozyme nucleotide 
sequences which inhibit production of one or more proteins encoded by a pathogenic 
strain of HIV-1. 



72. A viral isolate which: 

0) ■ ■ is genetically or immunologically related to a pathogenic strain of HTV- 1 ; 

(ii) is substantially non-pathogenic in human subjects; 

(iii) < comprises a first nucleotide sequence constituting its genome which is 

capable of hybridising under medium stringency conditions to SEQ ID 
NO: 1 or a complementary form thereof; and 

(iv) comprises a second nucleotide sequence within said first nucleotide 



sequence and which second nucleotide sequence directs expression of a 
mRNA molecule capable of inhibiting, reducing or otherwise down- 
regulating translation of a protein or polypeptide encoded by a pathogenic 
strain of HIV-1 or inhibit, reduce or otherwise down regulate operation 
of a non-protein encoding a region of a pathogenic strain of HIV-1. 

73. A viral isolate according to claim 72 wherein the targeted protein or polypeptide 
is selected from the list consisting of gag or poL 

74. A viral isolate according to claim 72 wherein the targeted protein or polypeptide 
is selected from the list consisting of em or tat. 



75. A viral isolate according to claim 72 wherein the targeted protein or polypeptide 
is selected from the list consisting of rev or vpu. 
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76. A viral isolate according to claim 72 wherein the targeted protein or polypeptide 
is selected from the list consisting of vpr, vif or nef. 

77. A viral isolate according to claim 72 wherein the targeted protein is nef. 

78. A method for determining the pathogenicity of an HIV-1 strain after said HIV-1 
strain infects cells of an individual, said method comprising determining the presence 
of a deletion mutation in the genome of said HIV-1 wherein said deletion mutation 
results in said genome being unable to synthesise a polypeptide or protein from a 
pathogenic strain of HIV-1 or directing the synthesis of a truncated form of said 
polypeptide or protein wherein the presence of a such a mutation is indicative of the 
presence of a non-pathogenic strain of HIV-1. 

79. The method according to claim 78 wherein said non-pathogenic HIV-1 carries 
a deletion in its genome of at least 10 nucleotides within the region from nucleotide 
8787 to nucleotide 9709 using the nucleotide numbering of HIV-1 strain NL4-3. 

80. A strain of HIV-1 according to claim 79 wherein said non-pathogenic HIV-1 
carries a deletion in its genome of at least 10 nucleotides from within a region selected 
from the list consisting of: 

nucleotide (i) 8830-8862; 

(ii) 9009-9035; 

(iii) 9019-9029; and 

(iv) 9033-9049. 

81. A strain of HIV-1 according to claim 79 wherein said non-pathogenic HIV-1 
carries a deletion in its genome of at least 10 nucleotides from within a region selected 
from the list consisting of: 

nucleotide (v) 9281-9371; 

(vi) 9281-9362; 

(vii) 9105-9224; and 

(viii) 9271-9370. 
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82. A strain of HTV-1 according to claim 79 wherein said non-pathogenic HIV-1 
carries a deletion in its genome of at least 10 nucleotides from within a region selected 
from the list consisting of: 

nucleotide (ix) 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 

(xii) 9112-9204. 

83. A strain of HIV-1 according to claim 79 wherein said non-pathogenic HTV-1 
carries a deletion in its genome of at least 10 nucleotides from within a region selected 
from the list consisting of: 

nucleotide (xiii) 9105-9224; 

(xiv) 9389-9395; and 

(xv) 9281-9366. 



84. A strain 

least one of the 

ATGGGTGGCA 
GGGTGGCAAG 
GTGGCAAGTG 
GGCAAGTGGT 
CAAGTGGTCA 
AGTGGTCAAA 
TGGTCAAAAA 
GTCAAAAAGT 
CAAAAAGTAG 
AAAAGTAGTG 
AAGTAGTGTG 
GTAGTGTGAT 
AGTGTGATTG 
TGTGATTGGA 
TGATTGGATG 
ATTGGATGGC 
TGGATGGCCT 
GATGGCCTGC 



of HTV-1 according to claim 79 wherein said deletion encompasses at 
following decanucleotides from the nef gene of HIV-1 strain NL4-3: 



(SEQ ID NO:2) 
(SEQ ID NO:4) 
(SEQ ID NO: 6) 
(SEQ ID NO:8) 
(SEQ ID NO: 10) 
(SEQ ID NO:l>2) 
(SEQ ID NO: 14) 
(SEQ ID NO:16) 
(SEQ ID NO: 18) 
(SEQ ID NO: 20) 
(SEQ ID NO: 22) 
(SEQ ID NO:24) 
(SEQ ID NO:26) 
(SEQ ID NO: 28) 
(SEQ ID NO:30) 
(SEQ ID NO: 32) 
(SEQ ID NO: 34) 
(SEQ ID NO: 36) 



TGGGTGGCAA (SEQ ID NO: 3) ; 
GGTGGCAAGT (SEQ ID NO: 5); 
TGGCAAGTGG ( SEQ ID NO : 7 ) ; 
GCAAGTGGTC (SEQ ID NO: 9); 
AAGTGGTCAA (SEQ ID NO: 11) 
GTGGTCAAAA (SEQ ID NO: 13) 
GGTCAAAAAG (SEQ ID NO: 15) 
TCAAAAAGTA (SEQ ID NO: 17) 
AAAAAGTAGT (SEQ ID NO: 19); 
AAAGTAGTGT ( SEQ ID NO : 2 1 ) ; 
AGTAGTGTGA ( SEQ ID NO : 2 3 ) ; 
TAGTGTGATT (SEQ ID NO: 25) ; 
GTGTGATTGG (SEQ ID NO: 27) 
GTGATTGGAT (SEQ ID NO: 29) 
GATTGGATGG (SEQ ID NO: 31) 
TTGGATGGCC (SEQ ID NO: 33) 
GGATGGCCTG (SEQ ID NO: 35) 
ATGGCCTGCT (SEQ ID NO: 3 7) 
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TGGCCTGCTG 


(SEQ 


ID 


NO 


:38) 


/ 


GCCTGCTGTA 


(SEQ 


ID 


NO 


:40) 


/ 


CTGCTGTAAG 


(SEQ 


ID 


NO 


:42) 


/ 


GCTGTAAGGG 


(SEQ 


ID 


NO 


:44) 


/ 


TGTAAGGGAA 


(SEQ 


ID 


NO 


:46) 


i 


TAAGGGAAAG 


(SEQ 


ID 


NO 


:48) 


i 


AGGGAAAGAA 


(SEQ 


ID 


NO 


:50) 


t 


GGAAAGAATG 


(SEQ 


ID 


NO 


:52) 


9 


AAAGAATGAG 


(SEQ 


ID 


NO 


:54) 


; 


AGAATGAGAC 


(SEQ 


ID 


NO 


:56) 


/ 


AATGAGACGA 


(SEQ 


ID 


NO 


58) 


; 


TGAGACGAGC 


(SEQ 


ID 


NO: 


60) 


; 


AGACGAGCTG 


(SEQ 


ID 


NO: 


62) 




ACGAGCTGAG 


(SEQ 


ID 


NO: 


64) 




GAGCTGAGCC 


(SEQ 


ID 


NO: 


66) 




GCTGAGCCAG 


(SEQ 


ID 


NO: 


68) 




TGAGCCAGCA 


(SEQ. 


ID 


NO: 


70) 




AGCCAGCAGC 


(SEQ 


ID 


NO: 


72) 




CCAGCAGCAG 


(SEQ 


ID 


NO: 


74) 




AGCAGCAGAT 


(SEQ 


ID 


NO: 


76) 




CAGCAGATGG 


(SEQ 


ID 


NO: 


78) 




GCAGATGGGG 


(SEQ 


ID 


NO: 


80) 




AGATGGGGTG 


(SEQ 


ID 


NO: 


82) 




ATGGGGTGGG 


(SEQ 


ID 


NO: 


84) 




GGGGTGGGAG 


(SEQ 


ID 


NO: 


86) 




GGTGGGAGCA 


(SEQ 


ID 


NO: 


88) 




TGGGAGCAGT 


(SEQ 


ID 


NO: 


90) 




GGAGCAGTAT 


(SEQ 


ID 


NO: 


92) 




AGCAGTATCT 


(SEQ 


ID 


NO: 


94} 




CAGTATCTCG 


(SEQ 


ID 


NO: 


96) 




GTATCTCGAG 


(SEQ 


ID 


NO: 


98) 




ATCTCGAGAC 


(SEQ 


ID 


NO: 


100 




CTCGAGACCT 


(SEQ 


ID 


NO: 


102 


' f 


CGAGACCTAG 


(SEQ 


ID 


NO: 


104 


' t 


AGACCTAGAA 


(SEQ 


ID 


NO: 


106 




ACCTAGAAAA 


(SEQ 


ID 


NO: 


108 


/ 


CTAGAAAAAC 


(SEQ 


ID 


NO: 


110] 


' 7 


AGAAAAACAT 


(SEQ 


ID 


NO: 


112] 


1 / 
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GGCCTGCTGT 


(SEQ 


ID 


NO 


:39 


CCTGCTGTAA 


(SEQ 


ID 


NO 


:41 


TGCTGTAAGG 


(SEQ 


ID 


NO 


:43 


CTGTAAGGGA 


(SEQ 


ID 


NO 


:45 


GTAAGGGAAA 


(SEQ 


ID 


NO 


:47 


AAGGGAAAGA 


(SEQ 


ID 


NO- 


49 


GGGAAAGAAT 


(SEQ 


ID 


NO: 


51 


GAAAGAATGA 


(SEQ 


ID 


NO: 


53 


AAGAATGAGA 


(SEQ 


ID 


NO: 


55 


GAATGAGACG 


(SEQ 


ID 


NO: 


57 


ATGAGACGAG 


(SEQ 


ID 


NO: 


59, 


GAGACGAGCT 


(SEQ 


ID 


NO: 


61] 


GACGAGCTGA 


(SEQ 


ID 


NO: 


63] 


CGAGCTGAGC 


(SEQ 


ID 


NO: 


65] 


AGCTGAGCCA 


(SEQ 


ID 


NO: 


67] 


CTGAGCCAGC 


(SEQ 


ID 


NO: 


69} 


GAGCCAGCAG 


(SEQ 


ID 


NO: 


71) 


GCCAGCAGCA 


(SEQ 


ID 


NO: 


73) 


CAGCAGCAGA 


(SEQ 


ID 


NO: 


75) 


GCAGCAGATG 


(SEQ 


ID 


NO: 


77) 


AGCAGATGGG 


(SEQ 


ID 


NO: 


79) 


CAGATGGGGT 


(SEQ 


ID 


NO: 


81) 


GATGGGGTGG 


(SEQ 


ID 


NO: 


83) 


TGGGGTGGGA 


(SEQ 


ID 


NO: 


85) 


GGGTGGGAGC 


(SEQ 


ID 


NO: 


87) 


GTGGGAGCAG 


(SEQ 


ID 


NO: 


89) 


GGGAGCAGTA 


(SEQ 


ID 


NO: 


91) 


GAGCAGTATC 


(SEQ 


ID 


NO: 


93) 


GCAGTATCTC 


(SEQ 


ID 


NO: 


95) 


AGTATCTCGA 


(SEQ 


ID 


NO: 


97) 


TATCTCGAGA 


(SEQ 


ID 


NO: 


99) 


TCTCGAGACC 


(SEQ 


ID 


NO: 101) 



TCGAGACCTA (SEQ ID NO: 103) 

GAGACCTAGA (SEQ ID NO: 105) 

GACCTAGAAA (SEQ ID NO: 107) 

CCTAGAAAAA (SEQ ID NO: 109) 

TAGAAAAACA (SEQ ID NO: 111) 

GAAAAACATG (SEQ ID NO: 113) 
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AAAAACATGG (SEQ ID NO: 114) 
AAACATGGAG (SEQ ID NO: 116) 
ACATGGAGCA (SEQ ID NO: 118) 
ATGGAGCAAT (SEQ ID NO: 120) 
GGAGCAATCA (SEQ ID NO: 122) 
AGCAATCACA (SEQ ID NO: 124) 
CAATCACAAG (SEQ ID NO: 126) 
ATCACAAGTA (SEQ ID NO: 12 8) 
CACAAGTAGC (SEQ ID NO: 130) 
CAAGTAGCAA (SEQ ID NO: 13 2) 
AGTAGCAATA (SEQ ID NO: 134) 
TAGCAATACA (SEQ ID NO: 13 6) 
GCAATACAGC (SEQ ID NO: 138), 
AATACAGCAG (SEQ ID NO: 140) ; 
TACAGCAGCT (SEQ ID NO: 142) ; 
CAGCAGCTAA (SEQ ID * NO: 144) :j 
GCAGCTAACA (SEQ . ID NO: 146) ; 
AGCTAACAAT (SEQ ID NO: 148) ; 
CTAACAATGC (SEQ ID NO : 150) ; 
AACAATGCTG (SEQ ID NO: 152) ; 
CAATGCTGCT (SEQ ID NO: 154) ; 
ATGCTGCTTG (SEQ ID NO: 156) ; 
GCTGCTTGTG (SEQ ID NO: 158); 
TGCTTGTGCC (SEQ ID NO: 160) ; 
CTTGTGCCTG (SEQ ID NO: 162) ; 
TGTGCCTGGC (SEQ ID NO:l£4); 
TGCCTGGCTA (SEQ ID NO: 166); 
CCTGGCTAGA (SEQ ID NO: 168) ; 
TGGCTAGAAG (SEQ ID NO:170); 
GCTAGAAGCA (SEQ ID NO: 172); 
TAGAAGCACA (SEQ ID NO: 174); 
GAAGCACAAG (SEQ ID NO:176); 
AGCACAAGAG (SEQ ID NO: 178); 
CACAAGAGGA (SEQ ID NO:180); 
CAAGAGGAGG (SEQ ID NO: 182); 
AGAGGAGGAA (SEQ ID NO: 184); 
AGGAGGAAGA (SEQ ID NO: 186); 
GAGGAAGAGG (SEQ ID NO: 188); 



AAAACATGGA (SEQ ID NO: 115) 
AACATGGAGC (SEQ ID NO: 117) 
CATGGAGCAA (SEQ ID NO: 119) 
TGGAGCAATC (SEQ ID NO: 121) 
GAGCAATCAC (SEQ ID NO: 123) 
GCAATCACAA (SEQ ID NO: 125) 
AATCACAAGT (SEQ ID NO: 127) 
TCACAAGTAG (SEQ ID NO: 129) 
ACAAGTAGCA (SEQ ID NO: 131) 
AAGTAGCAAT (SEQ ID NO: 133) 
GTAGCAATAC (SEQ ID NO: 135) 
AGCAATACAG (SEQ ID NO:137) 
CAATACAGCA (SEQ ID NO: 139), 
ATACAGCAGC (SEQ ID NO: 141), 
ACAGCAGCTA (SEQ ID NO: 143); 
AGCAGCTAAC (SEQ ID NO:145); 
CAGCTAACAA (SEQ ID NO: 147); 
GCTAACAATG (SEQ ID NO: 149); 
TAACAATGCT (SEQ ID NO: 151); 
ACAATGCTGC (SEQ ID NO: 153); 
AATGCTGCTT (SEQ ID NO: 155); 
TGCTGCTTGT (SEQ ID NO: 157); 
CTGCTTGTGC (SEQ ID NO: 159) ; 
GCTTGTGCCT (SEQ ID NO: 161) ; 
TTGTGCCTGG (SEQ ID NO: 163) ; 
GTGCCTGGCT (SEQ ID NO: 165) ; 
GCCTGGCTAG (SEQ ID NO:167); 
CTGGCTAGAA (SEQ ID NO: 169) ; 
GGCTAGAAGC (SEQ ID NO: 171) ; 
CTAGAAGCAC (SEQ ID NO: 173) ; 
AGAAGCACAA (SEQ ID NO:175) ; 
AAGCACAAGA (SEQ ID NO:177) ; 
GCACAAGAGG (SEQ ID NO:179); 
ACAAGAGGAG (SEQ ID NO:181) ; 
AAGAGGAGGA (SEQ ID NO: 183) ; 
GAGGAGGAAG (SEQ ID NO:185); 
GGAGGAAGAG (SEQ ID NO:187); 
AGGAAGAGGT (SEQ ID NO: 189) ; 
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GGAAGAGGTG 


(SEQ 


ID 


NO 


:190 


AAGAGGTGGG 


(SEQ 


ID 


NO 


:192 


GAGGTGGGTT 


(SEQ 


ID 


NO 


:194 


GGTGGGTTTT 


(SEQ 


ID 


NO 


:19S 


TGGGTTTTCC 


(SEQ 


ID 


NO 


:198 


GGTTTTCCAG 


(SEQ 


ID 


NO 


:200 


TTTTCCAGTC 


(SEQ 


ID 


NO 


:202 


TTCCAGTCAC 


(SEQ 


ID 


NO 


:204 


CCAGTCACAC 


(SEQ 


ID 


NO 


:206 


AGTCACACCT 


(SEQ 


ID 


NO 


:208 


TCACACCTCA 


(SEQ 


ID 


NO 


:210. 


ACACCTCAGG 


(SEQ 


ID 


NO 


:212] 


ACCTCAGGTA 


(SEQ 


ID 


NO 


:214] 


CTCAGGTACC 


(SEQ 


ID 


NO. 


:216] 


CAGGTACCTT 


(SEQ 


ID 


NO: 


218] 


GGTACCTTTA 


(SEQ 


ID 


NO: 


220] 


TACCTTTAAG 


(SEQ 


ID 


NO: 


222) 


CCTTTAAGAC 


(SEQ 


ID 


NO: 


224) 


TTTAAGACCA 


(SEQ 


ID 


NO: 


226) 


TAAGACCAAT 


(SEQ 


ID 


NO: 


228) 


AGACCAATGA 


(SEQ 


ID 


NO: 


230) 


ACCAATGACT 


(SEQ 


ID 


NO: 


232) 


CAATGACTTA 


(SEQ 


ID 


NO: 


234) 


ATGACTTACA 


(SEQ 


ID 


NO: 


236) 


GACTTACAAG 


(SEQ 


ID 


NO: 


238) 


CTTACAAGGC 


(SEQ 


ID 


NO: 


240) 


TACAAGGCAG 


(SEQ 


ID 


NO: 


242) 


CAAGGCAGCT 


(SEQ 


ID 


NO: 


244) 


AGGCAGCTGT 


(SEQ 


ID 


NO: 


246) 


GCAGCTGTAG 


(SEQ 


ID 


NO: 


248) 


AGCTGTAGAT 


(SEQ 


ID 


NO: 


250) 


CTGTAGATCT 


(SEQ 


ID 


NO: 


252) 


GTAGATCTTA 


(SEQ 


ID 


NO: 


254) 


AGATCTTAGC 


(SEQ 


ID 


NO: 


256) 


ATCTTAGCCA 


(SEQ 


ID 


NO: 


258) 


CTTAGCCACT 


(SEQ 


ID 


NO: 


260) 


TAGCCACTTT 


(SEQ 


ID 


NO: 


262) 


GCCACTTTTT 


(SEQ 


ID 


NO: 


264) 
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GAAGAGGTGG 


(SEQ 


ID 


NO 


:191) ; 


AGAGGTGGGT 


(SEQ 


ID 


NO 


:193) ; 


AGGTGGGTTT 


(SEQ 


ID 


NO 


:195) ; 


GTGGGTTTTC 


(SEQ 


ID 


NO 


:197) ; 


GGGTTTTCCA 


(SEQ 


ID 


NO 


:199) ; 


GTTTTCCAGT 


(SEQ 


ID 


NO 


:201) ; 


TTTCCAGTCA 


(SEQ 


ID 


NO 


:203) ; 


TCCAGTCACA 


(SEQ 


ID 


NO 


:205) ; 


CAGTCACACC 


(SEQ 


ID 


NO 


:207) ; 


GTCACACCTC 


(SEQ 


ID 


NO 


:209) ; 


CACACCTCAG 


(SEQ 


ID 


NO 


:211) / 


CACCTCAGGT 


(SEQ 


ID 


NO 


:213) ; 


CCTCAGGTAC 


(SEQ 


ID 


NO: 


215) ; 


TCAGGTACCT 


(SEQ 


ID 


NO: 


217) ; 


AGGTACCTTT 


(SEQ 


ID 


NO: 


219) ; 


GTACCTTTAA 


(SEQ 


ID 


NO: 


221) ; 


ACCTTTAAGA 


(SEQ 


ID 


NO: 


223) ; 


CTTTAAGACC 


(SEQ 


ID 


NO: 


225) ; 


TTAAGACCAA 


(SEQ 


ID 


NO.: 


227) ; 


AAGACCAATG 


(SEQ 


ID 


NO: 


229) ; 


GACCAATGAC 


(SEQ 


ID 


NO: 


231) ; 


CCAATGACTT 


(SEQ 


ID 


NO: 


233) ; 


AATGACTTAC 


(SEQ 


ID 


NO: 


235) ; 


TGACTTACAA 


(SEQ 


ID 


NO: 


237) ; 


ACTTACAAGG 


(SEQ 


ID 


NO: 


239) ; 


TTACAAGGCA 


(SEQ 


ID 


NO: 


241) ; 


ACAAGGCAGC 


(SEQ 


ID 


NO: 


243) ; 


AAGGCAGCTG 


(SEQ 


ID 


NO: 


245) ; 


GGCAGCTGTA 


(SEQ 


ID 


NO: 


247) ; 


CAGCTGTAGA 


(SEQ 


ID 


NO: 


249) ; 


GCTGTAGATC 


(SEQ 


ID 


NO: 


251) ; 


TGTAGATCTT 


(SEQ 


ID 


NO: 


253) ; 


TAGATCTTAG 


(SEQ 


ID 


NO: 


255) ; 


GATCTTAGCC 


(SEQ 


ID 


NO: 


257) ; 


TCTTAGCCAC 


(SEQ 


ID 


NO: 


259) ; 


TTAGCCACTT 


(SEQ 


ID 


NO: 


261) ; 


AGCCACTTTT 


(SEQ 


ID 


NO: 


263) / 


CCACTTTTTA 


(SEQ 


ID 


NO: 


265) ; 
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CACTTTTTAA (SEQ ID NO: 2 66) ; 

CTTTTTAAAA (SEQ ID NO: 268) ; 

TTTTAAAAGA (SEQ ID NO:270) ; 

TTAAAAGAAA (SEQ ID NO:272); 

AAAAGAAAAG (SEQ ID NO: 274); 

AAGAAAAGGG (SEQ ID NO:276); 

GAAAAGGGGG (SEQ ID NO: 278) ; 

AAAGGGGGGA (SEQ ID NO:280); 

AGGGGGGACT (SEQ ID NO:282); 

GGGGGACTGG (SEQ ID NO: 284) ; 

GGGACTGGAA (SEQ ID NO:286) ; 

GACTGGAAGG (SEQ ID NO:288) ; 

CTGGAAGGGC (SEQ ID NO: 290) ; 

GGAAGGGCTA (SEQ ID NO: 292) ; 

AAGGGCTAAT (SEQ ID NO: 294) ; 

GGGCTAATTC (SEQ ID NO: 296) ; 

GCTAATTCAC (SEQ ID NO:298); 
TAATTCACTG (SEQ ID NO: 300); 
ATTCACTCCC (SEQ ID NO: 302) ; 
TCACTCCCAA (SEQ ID NO: 304);- 
ACTCCGAAAG (SEQ ID NO: 3 06) ; 
TCCCAAAGAA (SEQ ID NO:308) ; 
CCAAAGAAGA (SEQ ID NO:310); 
AAAGAAGACA (SEQ ID NO:312) ; 
AGAAGACAAG (SEQ ID NO: 314) ; 
AAGACAAGAT (SEQ ID NO:316) ; 
GACAAGATAT (SEQ ID NO:318) ; 
CAAGATATCC (SEQ ID NO:320) ; 
AGATATCCTT (SEQ ID NO: 322) ; 
ATATCCTTGA (SEQ ID NO: 324) ; 
ATCCTTGATC (SEQ ID NO: 326) ; 
CCTTGATCTG ( SEQ ID NO : 3 2 8 ) ; 
TTGATCTGTG (SEQ ID NO: 330) ; 
GATCTGTGGA (SEQ ID NO: 332) ; 
TCTGTGGATC (SEQ ID NO: 334) ; 
TGTGGATCTA (SEQ ID NO: 336) ; 
TGGATCTACC (SEQ ID NO: 338) ; 
GATCTACCAC ( SEQ ID NO : 3 4 0 ) ; 



PCT/AU95/00063 

232 - 

ACTTTTTAAA (SEQ ID NO: 267) ; 

TTTTTAAAAG (SEQ ID NO: 269); 

TTTAAAAGAA (SEQ ID NO: 271) ; 

TAAAAGAAAA (SEQ ID NO: 273); 

AAAGAAAAGG (SEQ ID NO:275) ; 

AGAAAAGGGG (SEQ ID NO: 277) ; 

AAAAGGGGGG (SEQ ID NO:279); 

AAGGGGGGAC ( SEQ ID NO : 2 8 1 ) ; 

GGGGGGACTG (SEQ ID NO:283); 

GGGGACTGGA (SEQ ID NO: 285) ; 

GGACTGGAAG (SEQ ID NO: 287) ; 

ACTGGAAGGG (SEQ ID NO: 289) ; 

TGGAAGGGCT (SEQ ID NO:291) ; 

GAAGGGCTAA (SEQ ID NO:293); 
AGGGCTAATT (SEQ ID NO: 295) ; 
GGCTAATTCA (SEQ ID NO:297) ; 
CTAATTCACT (SEQ ID NO:299) ; 
AATTCACTCC (SEQ ID NO: 301) ; 
TTCACTCCCA (SEQ ID NO: 3 03) ; 
CACTCCCAAA (SEQ ID NO: 3 05) 7 - 
CTCCCAAAGA (SEQ ID NO: 3 07) ; 
CCCAAAGAAG (SEQ ID NO:309) ; 
CAAAGAAGAC (SEQ ID NO:311) ; 
AAGAAGACAA (SEQ ID NO: 3 13) ; 
GAAGACAAGA (SEQ ID NO:315) ; 
AGACAAGATA (SEQ ID NO:317) ; 
ACAAGATATC (SEQ ID NO:319) ; 
AAGATATCCT (SEQ ID NO:321); 
GATATCCTTG (SEQ ID NO:323); 
TATCCTTGAT (SEQ ID NO:325); 
TCCTTGATCT (SEQ ID NO: 327); 
CTTGATCTGT (SEQ ID NO: 329); 
TGATCTGTGG (SEQ ID NO: 331); 
ATCTGTGGAT (SEQ ID NO:333); 
CTGTGGATCT (SEQ ID NO: 335) ; 
GTGGATCTAC (SEQ ID NO: 337) ; 
GGATCTACCA (SEQ ID NO:339) ; 
ATCTACCACA (SEQ ID NO: 341) ; 
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TCTACCACAC 


(SEQ 


ID 


NO 


:342) 


TACCACACAC 


(SEQ 


ID 


NO 


:344) 


CCACACACAA 


(SEQ 


ID 


NO 


:346) 


ACACACAAGG 


(SEQ 


ID 


NO 


:348) 


ACACAAGGCT 


(SEQ 


ID 


NO 


:350) 


ACAAGGCTAC 


(SEQ 


ID 


NO 


:352) 


AAGGCTACTT 


(SEQ 


ID 


NO 


:354) 


GGCTACTTCC 


(SEQ 


ID 


NO 


:356) 


CTACTTCCCT 


(SEQ 


ID 


NO 


:358) 


ACTTCCCTGA 


(SEQ 


ID 


NO 


:360) 


TTCCCTGATT 


(SEQ 


ID 


NO 


:362) 


CCCTGATTGG 


(SEQ 


ID 


NO 


:364) 


CTGATTGGCA 


(SEQ 


ID 


NO 


:366) 


GATTGGCAGA 


(SEQ 


ID 


NO 


:368) 


TTGGCAGAAC 


(SEQ 


ID 


NO 


:370) 


GGCAGAACTA 


(SEQ 


ID 


NO 


:372) 


CAGAACTACA 


(SEQ. 


ID 


NO 


:374) 


GAACTACACA 


(SEQ 


ID 


NO 


:376) 


ACTACACACC 


(SEQ 


ID 


NO 


:378) 


TACACACCAG 


(SEQ 


ID 


NO 


:380) 


CACACCAGGG 


(SEQ 


ID 


NO 


:382) 


CACCAGGGCC 


(SEQ 


ID 


NO 


:384) 


CCAGGGCCAG 


(SEQ 


ID 


NO 


:386) 


AGGGCCAGGG 


(SEQ 


ID 


NO 


:388) 


GGCCAGGGGT 


(SEQ 


ID 


NO 


:390) 


CCAGGGGTCA 


(SEQ 


ID 


NO 


:392) 


AGGGGTCAGA 


(SEQ 


ID 


NO 


:394) 


GGGTCAGATA 


(SEQ 


ID 


NO 


:396) 


GTCAGATATC 


(SEQ 


ID 


NO 


:398) 


CAGATATCCA 


(SEQ 


ID 


NO 


:400) 


GATATCCACT 


(SEQ 


ID 


NO 


:402) 


TATCCACTGA 


(SEQ 


ID 


NO 


:404) 


TCCACTGACC 


(SEQ 


ID 


NO 


:406) 


CACTGACCTT 


(SEQ 


ID 


NO 


:408) 


CTGACCTTTG 


(SEQ 


ID 


NO 


:410) 


GACCTTTGGA 


(SEQ 


ID 


NO 


:412) 


CCTTTGGATG 


(SEQ 


ID 


NO 


:414) 


TTTGGATGGT 


(SEQ 


ID 


NO 


:416) 
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CTACCACACA 


(SEQ 


ID 


NO: 


343) ; 


ACCACACACA 


(SEQ 


ID 


NO: 


345) ; 


CACACACAAG 


(SEQ 


ID 


NO: 


347) ; 


CACACAAGGC 


(SEQ 


ID 


NO: 


349) ; 


CACAAGGCTA 


(SEQ 


ID 


NO: 


351) ; 


CAAGGCTACT 


(SEQ 


ID 


NO: 


353) ; 


AGGCTACTTC 


(SEQ 


ID 


NO: 


355) ; 


GCTACTTCCC 


(SEQ 


ID 


NO: 


357) ; 


TACTTCCCTG 


(SEQ 


ID 


NO: 


359) ; 


CTTCCCTGAT 


(SEQ 


ID 


NO: 


361) ; 


TCCCTGATTG 


(SEQ 


ID 


NO: 


363) ; 


CCTGATTGGC 


(SEQ 


ID 


NO: 


365) ; 


TGATTGGCAG 


(SEQ 


ID 


NO: 


367) ; 


ATTGGCAGAA 


(SEQ 


ID 


NO: 


369) ; 


TGGCAGAACT 


(SEQ 


ID 


NO: 


371) ; 


GCAGAACTAC 


(SEQ 


ID 


NO: 


373) ; 


AGAACTACAC 


(SEQ 


ID 


NO: 


375) ; 


AACTACACAC 


(SEQ 


ID 


NO: 


377) ; 


CTACACACCA 


(SEQ 


ID 


NO: 


379) ; 


ACACACCAGG 


(SEQ 


ID 


NO: 


381) ; 


ACACCAGGGC 


(SEQ 


ID 


NO: 


383) ; 


ACCAGGGCCA 


(SEQ 


ID 


NO: 


385) ; 


CAGGGCCAGG 


(SEQ 


ID 


NO: 


387) ; 


GGGCCAGGGG 


(SEQ 


ID 


NO: 


389) ; 


GCCAGGGGTC 


(SEQ 


ID 


NO: 


391) ; 


CAGGGGTCAG 


(SEQ 


ID 


NO: 


393) ; 


GGGGTCAGAT 


(SEQ 


ID 


NO: 


395) ; 


GGTCAGATAT 


(SEQ 


ID 


NO: 


397) ; 


TCAGATATCC 


(SEQ 


ID 


NO: 


399) ; 


AGATATCCAC 


(SEQ 


ID 


NO: 


401) ; 


ATATCCACTG 


(SEQ 


ID 


NO: 


403) ; 


ATCCACTGAC 


(SEQ 


ID 


NO: 


405) ; 


CCACTGACCT 


(SEQ 


ID 


NO: 


407) ; 


ACTGACCTTT 


(SEQ 


ID 


NO: 


409) ; 


TGACCTTTGG 


(SEQ 


ID 


NO: 


411) ; 


ACCTTTGGAT 


(SEQ 


ID 


NO: 


413) ; 


CTTTGGATGG 


(SEQ 


ID 


NO: 


415) ; 


TTGGATGGTG 


(SEQ 


ID 


NO: 


417) ; 
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TGGATGGTGC (SEQ ID NO: 418) 
GATGGTGCTA (SEQ ID NO: 420) 
TGGTGCTACA (SEQ ID NO: 422) 
GTGCTACAAG (SEQ ID NO: 424) 
GCTACAAGCT (SEQ ID NO: 426) 
TACAAGCTAG (SEQ ID' NO: 428) 
CAAGCTAGTA (SEQ ID NO: 430) 
AGCTAGTACC (SEQ ID NO: 432) 
CTAGTACCAG (SEQ ID NO: 434) 
AGTACCAGTT (SEQ ID NO: 436) 
TACCAGTTGA (SEQ ID NO:438) 
CCAGTTGAGC (SEQ ID NO:440) 
AGTTGAGCCA (SEQ ID NO: 442) 
TTGAGCCAGA (SEQ ID NO: 444) 
GAGCCAGATA (SEQ ID NO: 44 6) 
GCCAGATAAG (SEQ ID NO: 44 8) 
CAGATAAGGT (SEQ ID NO:450) 
GATAAGGTAG (SEQ ID NO: 452) 
TAAGGTAGAA (SEQ ID NO: 454) , 
AGGTAGAAGA (SEQ ID NO: 456) 
GTAGAAGAGG (SEQ ID 110:458), 
AGAAGAGGCC (SEQ ID NO:460), 
AAGAGGCCAA (SEQ ID N0:462) ; 
GAGGCCAATA (SEQ ID NO:464) ; 
GGCCAATAAA (SEQ ID NO:466) ; 
CCAATAAAGG (SEQ ID NO:46€) ; 

AATAAAGGAG (SEQ ID NO:470) ; 

TAAAGGAGAG (SEQ ID NO: 472) ; 

AAGGAGAGAA (SEQ ID NO: 4 74) ; 

GGAGAGAACA (SEQ ID NO: 476) ; 

AGAGAACACC (SEQ ID NO: 478) ; 

AGAACACCAG (SEQ ID NO:480); 

AACACCAGCT (SEQ ID NO:482) ; 

CACCAGCTTG (SEQ ID NO: 4 84) ; 

CCAGCTTGTT (SEQ ID NO: 486) ; 

AGCTTGTTAC ( SEQ ID NO : 4 8 8 ) ; 

CTTGTTACAC ( SEQ ID NO : 4 9 0 ) ; 

TGTTACACCC ( SEQ ID NO : 4 9 2 ) ; 



GGATGGTGCT (SEQ ID NO: 419) 
ATGGTGCTAC (SEQ ID NO: 421) 
GGTGCTACAA (SEQ ID NO: 423) 
TGCTACAAGC (SEQ ID N0:425) 
CTACAAGCTA (SEQ ID NO: 42 7) 
ACAAGCTAGT (SEQ ID NO: 42 9) 
AAGCTAGTAC (SEQ ID NO: 431) 
GCTAGTACCA (SEQ ID NO: 4 33) 
TAGTACCAGT (SEQ ID NO: 435) 
GTACCAGTTG (SEQ ID NO: 43 7) 
ACCAGTTGAG (SEQ ID NO: 439) 
CAGTTGAGCC (SEQ ID N0:441) 
GTTGAGCCAG (SEQ ID NO: 443) 
TGAGCCAGAT (SEQ ID NO: 44 5) 
AGCCAGATAA (SEQ ID NO: 447) 
CCAGATAAGG (SEQ ID NO:449) 
AGATAAGGTA (SEQ ID NO: 451) 
ATAAGGTAGA. (SEQ ID NO: 453) 
AAGGTAGAAG (SEQ ID NO: 455) 
GGTAGAAGAG (SEQ ID NO:457) ( 
TAGAAGAGGC (SEQ ID N0:459) , 
GAAGAGGCCA (SEQ ID N0:461) , 

AGAGGCCAAT (SEQ ID NO: 463) ; 

AGGCCAATAA (SEQ ID N0:465) ; 
GCCAATAAAG (SEQ ID NO: 4 67) ; 
CAATAAAGGA (SEQ ID NO:469) ; 

ATAAAGGAGA (SEQ ID NO:471) ; 

AAAGGAGAGA (SEQ ID NO:473) ; 

AGGAGAGAAC (SEQ ID NO: 475) ; 

GAGAGAACAC (SEQ ID N0:477) ; 

GAGAACACCA (SEQ ID NO: 479) ; 

GAACACCAGC (SEQ ID NO:481); 

ACACCAGCTT ( SEQ ID NO : 4 8 3 ) ; 

ACCAGCTTGT (SEQ ID NO: 4 85) ; 

CAGCTTGTTA ( SEQ ID NO : 4 8 7 ) ; 

GCTTGTTACA ( SEQ ID NO : 4 8 9 ) ; 

TTGTTACACC (SEQ ID NO: 4 91); 

GTTACACCCT (SEQ ID NO: 4 93); 
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TTACACCCTG (SEQ ID NO: 4 94) ; 

ACACCCTGTG (SEQ ID NO: 496) ; 

ACCCTGTGAG (SEQ ID NO: 4 98); 

CCTGTGAGCC ( SEQ ID NO : 5 0 0 ) ; 

TGTGAGCCTG (SEQ ID NO: 502) ; 

TGAGCCTGCA (SEQ ID NO: 5 04) ; 

AGCCTGCATG (SEQ ID NO: 506); 

CCTGCATGGA (SEQ ID NO:508); 

TGCATGGAAT (SEQ ID NO:510); 

CATGGAATGG (SEQ ID NO: 512) ; 

TGGAATGGAT (SEQ ID NO: 514); 

GAATGGATGA (SEQ ID NO:516) ; 
ATGGATGACC (SEQ ID NO: 518) ; 
GGATGACCCT (SEQ ID NO: 520) ; 
ATGACCCTGA (SEQ ID NO : 522); 
GACCCTGAGA (SEQ ID NO: 524) ; 
CCCTGAGAGA (SEQ ID NO:526) ; 
- CTGAGAGAGA (SEQ ID NO:528); 
GAGAGAGAAG (SEQ ID NO:530); 
GAGAGAAGTG (SEQ ID NO: 532) ; 
: GAGAAGTGTT (SEQ ID NO: 534) ; 
GAAGTGTTAG (SEQ ID NO: 536) ; 
AGTGTTAGAG (SEQ ID NO:538) ; 
TGTTAGAGTG (SEQ ID NO: 540) ; 
TTAGAGTGGA (SEQ ID NO: 542) ; 
AGAGTGGAGG (SEQ ID NO: 544) ; 
AGTGGAGGTT (SEQ ID NO: 546); 
TGGAGGTTTG (SEQ ID NO: 548) ; 
GAGGTTTGAC (SEQ ID NO: 550) ; 
GGTTTGACAG (SEQ ID NO: 552) ; 
TTTGACAGCC (SEQ ID NO: 554) ; 
TGACAGCCGC (SEQ ID NO:556); 
ACAGCCGCCT (SEQ ID NO:558); 
AGCCGCCTAG (SEQ ID NO:560); 
CCGCCTAGCA (SEQ ID NO: 562) ; 
GCCTAGCATT (SEQ ID NO: 564) ; 
CTAGCATTTC (SEQ ID NO: 566) ; 
AGCATTTCAT (SEQ ID NO:568); 



TACACCCTGT (SEQ ID NO: 4 95) 
CACCCTGTGA (SEQ ID NO: 497) 
CCCTGTGAGC (SEQ ID NO: 4 99) 
CTGTGAGCCT (SEQ ID NO: 501) 
GTGAGCCTGC (SEQ ID NO: 503) 
GAGCCTGCAT (SEQ ID NO: 505) 
GCCTGCATGG (SEQ ID NO: 507) 
CTGCATGGAA (SEQ ID NO: 509) 
GCATGGAATG (SEQ ID NO: 511) 
ATGGAATGGA (SEQ ID NO: 513) 
GGAATGGATG (SEQ ID NO: 515) 
AATGGATGAC (SEQ ID NO: 517) 
TGGATGACCC (SEQ ID NO: 519) 
GATGACCCTG (SEQ ID NO: 521) 
TGACCCTGAG (SEQ ID NO: 523) 

ACCCTGAGAG (SEQ ID NO: 525) 
CCTGAGAGAG (SEQ ID NO: 52 7), 

TGAGAGAGAA (SEQ ID NO:529) , 

AGAGAGAAGT (SEQ ID NO: 531) ; 

AGAGAAGTGT (SEQ ID NO: 533) ; 

AGAAGTGTTA (SEQ ID NO: 535); 

AAGTGTTAGA (SEQ ID NO: 537) ; 

GTGTTAGAGT (SEQ ID NO: 539) ; 

GTTAGAGTGG (SEQ ID NO: 541) ; 

TAGAGTGGAG (SEQ ID NO: 543) ; 

GAGTGGAGGT (SEQ ID NO: 545); 

GTGGAGGTTT (SEQ ID NO: 547); 

GGAGGTTTGA (SEQ ID NO: 54 9) ; 

AGGTTTGACA (SEQ ID NO: 551) ; 

GTTTGACAGC (SEQ ID NO:553); 

TTGACAGCCG (SEQ ID NO: 555) ; 

GACAGCCGCC (SEQ ID NO: 557) ; 

CAGCCGCCTA (SEQ ID NO:559); 

GCCGCCTAGC (SEQ ID NO : 5 6 1 ) ; 

CGCCTAGCAT (SEQ ID NO: 563) ; 

CCTAGCATTT (SEQ ID NO: 565) ; 

TAGCATTTCA (SEQ ID NO:567) ; 

GCATTTCATC (SEQ ID NO: 569) ; 
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CATTTCATCA 


(SEQ 


ID 


NO: 


570) ; 


ATTTCATCAC 


(SEQ 


ID 


NO: 571) ; 


TTTCATCACG 


(SEQ 


ID 


NO: 


572) ; 


TTCATCACGT 


(SEQ 


ID 


NO: 573) ; 


TCATCACGTG 


(SEQ 


ID 


NO: 


574) ; 


CATCACGTGG 


(SEQ 


ID 


NO: 575) ; 


ATCACGTGGC 


(SEQ 


ID 


NO: 


576) ; 


TCACGTGGCC 


(SEQ 


ID 


NO: 577) ; 


CACGTGGCCC 


(SEQ 


ID 


NO: 


578) ; 


ACGTGGCCCG 


(SEQ 


ID 


NO: 579) ; 


CGTGGCCCGA 


(SEQ 


ID 


NO: 


580) ; 


GTGGCCCGAG 


(SEQ 


ID 


NO: 581) ; 


TGGCCCGAGA 


(SEQ 


ID 


NO: 


582) ; 


GGCCCGAGAG 


(SEQ 


ID 


NO: 583) ; 


GCCCGAGAGC 


(SEQ 


ID 


NO: 


584) ; 


CCCGAGAGCT 


(SEQ 


ID 


NO: 585) ; 


CCGAGAGCTG 


(SEQ 


ID 


NO: 


586) ; 


CGAGAGCTGC 


(SEQ 


ID 


NO: 587) ; 


GAGAGCTGCA 


(SEQ 


ID 


NO: 


588) ; 


AGAGCTGCAT 


(SEQ 


ID 


NO: 589) ; 


GAGCTGCATC 


(SEQ 


ID 


NO: 


590) ; 


AGCTGCATCC 


(SEQ 


ID 


NO: 591) ; 


GCTGCATCCG 


(SEQ 


ID 


NO: 


592) ; 


CTGCATCCGG 


(SEQ 


ID 


NO: 593) ; 


TGCATCCGGA 


(SEQ 


ID 


NO: 


594) ; 


GCATCCGGAG 


(SEQ 


ID 


NO: 595) ; 


CATCCGGAGT 


(SEQ 


ID 


NO: 


596) ; 


ATCCGGAGTA 


(SEQ 


ID 


NO: 597) ; 


TCCGGAGTAC 


(SEQ 


ID 


NO:598) ; 


CCGGAGTACT 


(SEQ 


ID 


NO: 599) ; 


CGGAGTACTT 


(SEQ 


ID 


NO: 


600) ; 


GGAGTACTTC 


(SEQ 


ID 


NO: 601) ; 


GAGTACTTCA 


(SEQ 


ID 


NO: 


602) ; 


AGTACTTCAA 


(SEQ 


ID 


NO: 603) ; 


GTACTTCAAG 


(SEQ 


ID 


NO: 


604) ; 


TACTTCAAGA 


(SEQ 


ID 


NO: 605) ; 


ACTTCAAGAA 


(SEQ 


ID 


NO: 


606) ; 


CTTCAAGAAC 


(SEQ 


ID 


NO: 607) ; 


TTCAAGAACT 


(SEQ 


ID 


NO: 


608) ; 


TCAAGAACTG 


(SEQ 


ID 


NO: 609) ; 


CAAGAACTGC 


(SEQ 


ID. 


NO: 


610) ; 


AAGAACTGCT 


(SEQ 


ID 


NO: 611) ; 


AGAACTGCTG 


(SEQ 


ID 


NO: 


612) ; 


GAACTGCTGA 


(SEQ 


ID 


NO: 613) . 



85. A strain of HTV-1 according to claim 79 wherein said deletion encompasses at 
least one of the following decanucleotides from the LTR region of HTV-1 strain NL4-3: 



GCTTTTTGCC 


(SEQ 


ID NO: 652) 


. CTTTTTGCCT 


(SEQ 


ID 


NO: 


653) 


TTTTTGCCTG 


(SEQ 


ID NO: 654) 


. TTTTGCCTGT 


(SEQ 


ID 


NO: 


655) 


TTTGCCTGTA 


(SEQ 


ID NO:656) 


• TTGCCTGTAC 


(SEQ 


ID 


NO: 


657) 


TGCCTGTACT 


(SEQ 


ID NO: 658) 


• GCCTGTACTG 


(SEQ 


ID 


NO: 


659) 


CCTGTACTGG 


(SEQ 


ID NO:660) , 


• CTGTACTGGG 


(SEQ 


ID 


NO: 


661) 


TGTACTGGGT 


(SEQ 


ID NO: 662) , 


r GTACTGGGTC 


(SEQ 


ID 


NO: 


663) 


TACTGGGTCT 


(SEQ 


ID NO: 664) , 


ACTGGGTCTC 


(SEQ 


ID 


NO: 


665) 


CTGGGTCTCT 


(SEQ 


ID NO:666) , 


■ TGGGTCTCTC 


(SEQ 


ID 


NO: 


667) 


GGGTCTCTCT 


(SEQ 


ID NO: 668) , 


• GGTCTCTCTG 


(SEQ 


ID 


NO: 


669) 


GTCTCTCTGG 


(SEQ 


ID NO: 670) , 


TCTCTCTGGT 


(SEQ 


ID 


NO: 


671) 


CTCTCTGGTT 


(SEQ 


ID NO: 672) ( 


TCTCTGGTTA 


(SEQ 


ID 


NO: 


673) 


CTCTGGTTAG 


(SEQ 


ID NO:674) , 


TCTCTGGTTA 


(SEQ 


ID 


NO: 


675) 
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CTGGTTAGAC 


(SEQ 


ID 


NO 


:676) 


GGTTAGACCA 


(SEQ 


ID 


NO 


:678) 


TTAGACCAGA 


(SEQ 


ID 


NO 


-.680) 


AGACCAGATC 


(SEQ 


ID 


NO 


:682) 


ACCAGATCTG 


(SEQ 


ID 


NO 


:684) 


CAGATCTGAG 


(SEQ 


ID 


NO 


:686) 


GATCTGAGCC 


(SEQ 


ID 


NO 


:688) 


TCTGAGCCTG 


(SEQ 


ID 


NO 


:690) 


TGAGCCTGGG 


(SEQ 


ID 


NO 


:692) 


AGCCTGGGAG 


(SEQ 


ID 


NO 


:694) 


CCTGGGAGCT 


(SEQ 


ID 


NO 


:696) 


TGGGAGCTCT 


(SEQ 


ID 


NO 


:698) 


GGAGCTCTCT 


(SEQ 


ID 


NO 


:700) 


AGCTCTCTGG 


(SEQ 


ID 


NO 


:702) 


CTCTCTGGCT 


(SEQ 


ID 


NO 


:704) 


CTCTGGCTAA 


(SEQ 


ID 


NO: 


:706) 


CTGGCTAACT 


(SEQ 


ID 


NO: 


:708) 


GGCTAACTAG 


(SEQ 


ID 


NO: 


•710) 


CTAACTAGGG 


(SEQ 


ID 


NO: 


712) 


AACTAGGGAA 


(SEQ 


ID 


NO: 


714) 


CTAGGGAACC 


(SEQ 


ID 


NO: 


716) 


AGGGAACCCA 


(SEQ 


ID 


NO: 


718) 


GGAACCCACT 


(SEQ 


ID 


NO: 


720) 


AACCCACTGC 


(SEQ 


ID 


NO: 


722) 


CCCACTGCTT 


(SEQ 


ID 


NO: 


724) 


CACTGCTTAA 


(SEQ 


ID 


NO: 


726) 


CTGCTTAAGC 


(SEQ 


ID 


NO: 


728) 


GCTTAAGCCT 


(SEQ 


ID 


NO: 


730) 


TTAAGCCTCA 


(SEQ 


ID 


NO: 


732) 


AAGCCTCAAT 


(SEQ 


ID 


NO: 


734) 


GCCTCAATAA 


(SEQ 


ID 


NO: 


736) 


CTCAATAAAG 


(SEQ 


ID 


NO: 


738) 


CAATAAAGCT 


(SEQ 


ID 


NO: 


740) 


ATAAAGCTTG 


(SEQ 


ID 


NO: 


742) 


AAAGCTTGCC 


(SEQ 


ID 


NO: 


744) 


AGCTTGCCTT 


(SEQ 


ID 


NO: 


746) , 


CTTGCCTTGA 


(SEQ 


ID 


NO: 


748) , 


TGCCTTGAGT 


(SEQ 


ID 


NO: 


750) , 
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TGGTTAGACC 


(SEQ 


ID 


NO 


:677) 


GTTAGACCAG 


(SEQ 


ID 


NO 


:679) 


TAGACCAGAT 


(SEQ 


ID 


NO 


:681) 


GACCAGATCT 


(SEQ 


ID 


NO 


:683) 


CCAGATCTGA 


(SEQ 


ID 


NO 


:685) 


AGATCTGAGC 


(SEQ 


ID 


NO 


:687) 


ATCTGAGCCT 


(SEQ 


ID 


NO 


:689) 


CTGAGCCTGG 


(SEQ 


ID 


NO 


:691) 


GAGCCTGGGA 


(SEQ 


ID 


NO 


:693) 


GCCTGGGAGC 


(SEQ 


ID 


NO 


:695) 


CTGGGAGCTC 


(SEQ 


ID 


NO 


:697) 


GGGAGCTCTC 


(SEQ 


ID 


NO 


:699) 


GAGCTCTCTG 


(SEQ 


ID 


NO 


:701) 


GCTCTCTGGC 


(SEQ 


ID 


NO: 


:703) 


TCTCTGGCTA 


(SEQ 


ID 


NO: 


:705) 


TCTGGCTAAC 


(SEQ 


ID 


NO: 


707) 


TGGCTAACTA 


(SEQ 


ID 


NO: 


709) 


GCTAACTAGG 


(SEQ 


ID 


NO: 


711) 


TAACTAGGGA 


(SEQ 


ID 


NO: 


713) 


ACTAGGGAAC 


(SEQ 


ID 


NO: 


715) 


TAGGGAACCC 


(SEQ 


ID 


NO: 


717) 


GGGAACCCAC 


(SEQ 


ID 


NO: 


719) 


GAACCCACTG 


(SEQ 


ID 


NO: 


721) 


ACCCACTGCT 


(SEQ 


ID 


NO: 


723) 


CCACTGCTTA 


(SEQ 


ID 


NO: 


725) 


ACTGCTTAAG 


(SEQ 


ID 


NO: 


727) 


TGCTTAAGCC 


(SEQ 


ID 


NO: 


729) 


CTTAAGCCTC 


(SEQ 


ID 


NO: 


731) 


TAAGCCTCAA 


(SEQ 


ID 


NO: 


733) 


AGCCTCAATA 


(SEQ 


ID 


NO: 


735) 


CCTCAATAAA 


(SEQ 


ID 


NO: 


737) 


TCAATAAAGC 


(SEQ 


ID 


NO: 


739) 


AATAAAGCTT 


(SEQ 


ID 


NO: 


741) 


TAAAGCTTGC 


(SEQ 


ID 


NO: 


743) 


AAGCTTGCCT 


(SEQ 


ID 


NO: 


745) 


GCTTGCCTTG 


(SEQ 


ID 


NO: 


747) 


TTGCCTTGAG 


(SEQ 


ID 


NO: 


749) 


GCCTTGAGTG 


(SEQ 


ID 


NO: 


751) , 
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CCTTGAGTGC 


(SEQ 


ID NO : 752 ) ; 


CTTGAGTGCT 


(SEQ 


ID 


NO: 753) 


TTGAGTGCTT 


(SEQ 


ID NO: 754) ; 


TGAGTGCTTC 


(SEQ 


ID 


NO: 755) 


GAGTGCTTCA 


(SEQ 


ID NO: 756) ; 


AGTGCTTCAA 


(SEQ 


ID 


NO: 757) 


GTGCTTCAAG 


(SEQ 


ID NO: 758) ; 


TGCTTCAAGT 


(SEQ 


ID NO: 759) 


GCTTCAAGTA 


(SEQ 


ID NO: 760) ; 


CTTCAAGTAG 


(SEQ 


ID 


NO: 761) 


TTCAAGTAGT 


(SEQ ID NO: 762) ; 


TCAAGTAGTG 


(SEQ 


ID 


NO: 763) 


CAAGTAGTGT 


(SEQ 


ID NO: 764) ; 


AAGTAGTGTG 


(SEQ 


ID 


NO: 765) 


AGTAGTGTGT 


(SEQ 


ID NO: 766) ; 


GTAGTGTGTG 


(SEQ 


ID 


NO: 767) 


TAGTGTGTGC 


(SEQ 


ID NO: 768) ; 


AGTGTGTGCC 


(SEQ 


ID 


NO: 769) 


GTGTGTGCCC 


(SEQ 


ID NO: 770) ; 


TGTGTGCCCG 


(SEQ 


ID 


NO: 771) 


GTGTGCCCGT 


(SEQ 


ID NO: 772) ; 


TGTGCCCGTC 


(SEQ 


ID 


NO: 773) 


GTGCCCGTCT 


(SEQ 


ID NO: 774) ; 


TGCCCGTCTG 


(SEQ 


ID 


NO: 775) 


GCCCGTCTGT 


(SEQ 


ID NO: 776) ; 


CCCGTCTGTT 


(SEQ 


ID 


NO: 777) 


CCGTCTGTTG 


(SEQ 


ID NO: 778) ; 


CGTCTGTTGT 


(SEQ 


ID 


NO: 779) , 


GTCTGTTGTG 


(SEQ 


ID NO: 780) ; 


TCTGTTGTGT 


(SEQ 


ID NO: 781) , 


CTGTTGTGTG 


(SEQ 


ID NO: 782) ; 


TGTTGTGTGA 


(SEQ 


ID 


NO: 783) , 


GTTGTGTGAC 


(SEQ 


ID NO: 784) ; 


TTGTGTGACT 


(SEQ 


ID 


NO: 785) , 


TGTGTGACTC 


(SEQ 


ID NO: 786) ; 


GTGTGACTCT 


(SEQ 


ID 


NO:787) , 


TGTGTGACTC 


(SEQ 


ID. NO: 788) ; 


GTGTGACTCT 


(SEQ 


ID 


NO: 789) ; 


TGTGACTCTG 


(SEQ 


ID NO: 790) ; 


GTGACTCTGG 


(SEQ 


ID 


NO:791) ; 


TGACTCTGGT 


(SEQ 


ID NO: 792) ; 


GACTCTGGTA 


(SEQ 


ID 


NO:793) ; 


ACTCTGGTAA 


(SEQ 


ID NO: 7 94); 


CTCTGGTAAC 


(SEQ 


ID NO: 795) ; 


TCTGGTAACT 


(SEQ 


ID NO: 796) ; 


CTGGTAACTA 


(SEQ 


ID 


NO: 797) ; 


TGGTAACTAG 


(SEQ 


ID NO: 798) ; 


GGTAACTAGA 


(SEQ 


ID 


NO:799) . 



86. A method for determining the pathogenicity of an KV-1 strain after said HIV-1 
strain infects cells of an individual, said method comprising determining the presence 
of a deletion mutation in its genome of at least 10 nucleotides within the region from 
nucleotide 8787 to nucleotide 9709 using the nucleotide numbering of HIV-1 strain 
NL4-3 wherein the presence of said deletion mutation is indicative of the presence of 
a non-pathogenic strain of HIV-1. 

87. A method according to claim 86 wherein said non-pathogenic HIV-1 carries a 
deletion in its genome of at least 10 nucleotides from within a region selected from the 
list consisting of: 
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nucleotide (i) 8830-8862; 

(ii) 9009-9035; 

(iii) 9019-9029; and 

(iv) 9033-9049. 



88. A method according to claim 86 wherein said non-pathogenic HTV-1 carries a 
deletion in its genome of at least 10 nucleotides from within a region selected from the 
list consisting of: 

nucleotide (v) 9281-9371; 

(vi) 9281-9362; 

(vii) 9105-9224; and 

(viii) 9271-9370. 

89. A method according to claim 86 wherein said non-pathogenic HIV-1 carries a 
deletion in its genome of at least 10 nucleotides from within a region selected from the 
list consisting of: 

nucleotide (ix) . 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 

(xii) 9112-9204. 

90. A method according to claim 86 wherein said non-pathogenic HIV-1 carries a 
deletion in its genome of at least 10 nucleotides from within a region selected from the 
list consisting of: 

nucleotide (xiii) 9105-9224; 

(xiv) 9389-9395; and 

(xv) 9281-9366. 



91. A strain of HTV-1 according to claim 86 wherein said deletion encompasses at 
least one of the following decanucleotides from the nef gene of HTV-1 strain NL4-3: 
ATGGGTGGCA (SEQ ID NO: 2); TGGGTGGCAA (SEQ ID NO:3) ; 

GGGTGGCAAG (SEQ ID NO: 4); GGTGGCAAGT . (SEQ ID NO: 5) ; 
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GTGGCAAGTG (SEQ ID NO: 6); 
GGCAAGTGGT (SEQ ID NO:8); 
CAAGTGGTCA (SEQ ID NO: 10); 

AGTGGTCAAA (SEQ ID NO:12); 

TGGTCAAAAA (SEQ ID NO:14); 

GTCAAAAAGT (SEQ ID NO:16); 

CAAAAAGTAG (SEQ ID NO: 18) ; 

AAAAGTAGTG (SEQ ID NO: 20) ; 

AAGTAGTGTG (SEQ ID NO: 22) ; 

GTAGTGTGAT (SEQ ID NO: 24) ; 

AGTGTGATTG (SEQ ID NO: 26) ; 

TGTGATTGGA (SEQ ID NO: 28) ; 

TGATTGGATG (SEQ ID NO: 30) ; 

ATTGGATGGC (SEQ ID NO: 32) ; 

TGGATGGCCT (SEQ ID NO: 34) ; 

GATGGCCTGC (SEQ ID NO: 36) ; 

TGGCCTGCTG (SEQ ID NO: 38) ; 

GCCTGCTGTA (SEQ ID NO: 40) ; 

CTGCTGTAAG (SEQ ID NO: 42) ; 

GCTGTAAGGG (SEQ ID NO: 44) ; 

TGTAAGGGAA (SEQ ID NO: 46); 

TAAGGGAAAG (SEQ ID NO: 48) ; 
AGGGAAAGAA (SEQ ID NO: 50); 
GGAAAGAATG (SEQ ID NO: 52); 
AAAGAATGAG (SEQ ID NO: 54) ; 
AGAATGAGAC (SEQ ID NO: 56*) ; 
AATGAGACGA (SEQ ID NO: 58); 
TGAGACGAGC (SEQ ID NO: 60); 
AGACGAGCTG (SEQ ID NO: 62); 
ACGAGCTGAG (SEQ ID NO: 64); 
GAGCTGAGCC (SEQ ID NO: 66); 
GCTGAGCCAG (SEQ ID NO: 68) ; 
TGAGCCAGCA (SEQ ID NO: 70) ; 
AGCCAGCAGC (SEQ ID NO:72); 
CCAGCAGCAG (SEQ ID NO:74); 
AGCAGCAGAT (SEQ ID NO:76); 
CAGCAGATGG (SEQ ID NO:78); 
GCAGATGGGG (SEQ ID NO: 80); 



TGGCAAGTGG 


(SEQ 


ID 


i NO:7); 


GCAAGTGGTC 


(SEQ ID NO: 9) ; 


AAGTGGTCAA 


(SEQ 


ID 


NO : 11} ; 


GTGGTCAAAA 


(SEQ 


ID 


NO: 13) ; 


GGTCAAAAAG 


(SEQ 


ID 


NO:15) ; 


TCAAAAAGTA 


(SEQ 


ID 


NO: 17) ; 


AAAAAGTAGT 


(SEQ ID NO 


>:19) ; 


AAAGTAGTGT 


(SEQ 


ID 


NO: 21) ; 


AGTAGTGTGA 


(SEQ 


ID 


NO:23) ; 


TAGTGTGATT 


(SEQ ID NO 


:25); 


GTGTGATTGG 


(SEQ 


ID 


NO:27) ; 


GTGATTGGAT 


(SEQ 


ID 


NO: 29) ; 


GATTGGATGG 


(SEQ 


ID 


NO: 31) ; 


TTGGATGGCC 


(SEQ 


ID 


NO: 33) ; 


GGATGGCCTG 


(SEQ 


ID 


NO: 35) ; 


ATGGCCTGCT . 


(SEQ 


ID 


NO: 37) ' ; 


GGCCTGCTGT 


(SEQ 


ID 


NO: 39) ; 


CCTGCTGTAA 


(SEQ 


ID 


NO: 41) ; 


TGCTGTAAGG 


(SEQ 


ID 


NO:43) ; 


CTGTAAGGGA 


(SEQ 


ID 


NO: 45) ; 


GTAAGGGAAA 


(SEQ 


ID - 


NO: 47) ; 


AAGGGAAAGA 


(SEQ 


ID 


NO:49) ; 


GGGAAAGAAT 


(SEQ 


ID 


NO: 51) ; 


GAAAGAATGA 


(SEQ 


ID 


NO: 53) ; 


AAGAATGAGA 


(SEQ 


ID 


NO:55) ; 


GAATGAGACG 


(SEQ 


ID 


NO: 57) ; 


ATGAGACGAG 


(SEQ 


ID 


NO: 59) ; 


GAGACGAGCT 


(SEQ 


ID 


NO: 61) ; 


GACGAGCTGA 


(SEQ 


ID 


NO: 63) ; 


CGAGCTGAGC 


(SEQ 


ID 


NO: 65) ; 


AGCTGAGCCA 


(SEQ 


ID 


NO: 67) ; 


CTGAGCCAGC 


(SEQ 


ID 


NO: 69) ; 


GAGCCAGCAG 


(SEQ 


ID 


NO: 71) ; 


GCCAGCAGCA 


(SEQ 


ID 


NO: 73) ; 


CAGCAGCAGA 


(SEQ 


ID 


NO: 75) ; 


GCAGCAGATG 


(SEQ 


ID 


NO: 77) ; 


AGCAGATGGG 


(SEQ 


ID 


NO: 79) ; 


CAGATGGGGT 


(SEQ 


ID 


NO: 81) ; 
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AGATGGGGTG 


(SEQ 


ID 


NO: 82) ; 


GATGGGGTGG 


(SEQ 


ID 


NO:83) 


ATGGGGTGGG 


(SEQ 


ID 


NO: 84) ; 


TGGGGTGGGA 


(SEQ 


ID 


NO:85) 


GGGGTGGGAG 


(SEQ 


ID 


NO: 86) ; 


GGGTGGGAGC 


(SEQ 


ID 


NO:87) 


GGTGGGAGCA 


(SEQ 


ID 


NO: 88) ; 


GTGGGAGCAG 


(SEQ 


ID 


NO:89) 


TGGGAGCAGT 


(SEQ 


ID 


NO: 90) ; 


GGGAGCAGTA 


(SEQ 


ID 


NO: 91) 


GGAGCAGTAT 


(SEQ 


ID 


NO:92) ; 


GAGCAGTATC 


(SEQ 


ID 


NO: 93) 


AGCAGTATCT 


(SEQ 


ID 


NO: 94) ; 


GCAGTATCTC 


(SEQ 


ID 


NO:95) 


CAGTATCTCG 


(SEQ 


ID 


NO: 96) ; 


AGTATCTCGA 


(SEQ 


ID 


NO: 97) 


GTATCTCGAG 


(SEQ 


ID 


NO: 98) ; 


TATCTCGAGA 


(SEQ 


ID 


NO: 99) 


ATCTCGAGAC 


(SEQ 


ID 


NO: 100) ; 


TCTCGAGACC 


(SEQ ID 


NO: 101) 


CTCGAGACCT 


(SEQ 


ID 


NO: 102) / 


TCGAGACCTA 


(SEQ 


ID 


NO 


:103) 




CGAGACCTAG 


(SEQ 


ID 


NO: 104) ; 


GAGACCTAGA 


(SEQ 


ID 


NO 


:105) 




AGACCTAGAA 


(SEQ 


ID 


NO: 106) ; 


GACCTAGAAA 


(SEQ 


ID 


NO 


:107) 




ACCTAGAAAA 


(SEQ 


ID 


NO: 108) ; 


CCTAGAAAAA 


(SEQ 


ID 


NO 


:109) , 




CTAGAAAAAC 


(SEQ 


ID 


NO: 110) ; 


TAGAAAAACA 


(SEQ 


ID 


NO 


:111) , 




AGAAAAACAT 


(SEQ 


ID 


NO: 112) ; 


GAAAAACATG 


(SEQ 


ID 


NO 


:113) ( 




AAAAACATGG 


(SEQ 


ID 


NO: 114) ; 


AAAACATGGA 


(SEQ 


ID 


NO 


:115) , 




AAACATGGAG 


(SEQ 


ID 


NO: 116) ; 


AACATGGAGC 


(SEQ 


ID 


NO: 117) , 




ACATGGAGCA 


(SEQ 


ID 


NO: 118) ; 


CATGGAGCAA 


(SEQ 


ID 


NO 


:119)"< 




ATGGAGCAAT 


(SEQ 


ID 


NO: 120) ; 


TGGAGCAATC 


(SEQ 


ID 


NO: 121) , 




GGAGCAATCA 


(SEQ 


ID 


NO: 122) ; 


GAGCAATCAC 


(SEQ 


ID 


NO 


:123) , 




AGCAATCACA 


(SEQ 


ID 


NO: 124) ; 


GCAATCACAA 


(SEQ 


ID 


NO 


:125) j 




CAATCACAAG 


(SEQ 


ID 


NO: 126) ; 


AATCACAAGT 


(SEQ 


ID 


NO 


:127) ; 




ATCACAAGTA 


(SEQ 


ID 


NO: 128) ; 


TCACAAGTAG 


(SEQ 


ID 


NO 


:129) ; 




CACAAGTAGC 


(SEQ 


ID 


NO: 130) ; 


ACAAGTAGCA 


(SEQ 


ID 


NO 


:131) j 




CAAGTAGCAA 


(SEQ 


ID NO: 132) ; 


AAGTAGCAAT 


(SEQ 


ID 


NO 


:133) j 




AGTAGCAATA 


(SEQ 


ID 


NO: 134) ; 


GTAGCAATAC 


(SEQ 


ID 


NO 


:135) ; 




TAGCAATACA 


(SEQ 


ID 


NO: 136) ; 


AGCAATACAG 


(SEQ 


ID 


NO 


:137) ; 




GCAATACAGC 


(SEQ 


ID 


NO: 138) ; 


CAATACAGCA 


(SEQ 


ID 


NO 


:139) ; 




AATACAGCAG 


(SEQ 


ID 


NO: 140) ; 


ATACAGCAGC 


(SEQ 


ID 


NO 


:141) : 




TACAGCAGCT 


(SEQ 


ID 


NO: 142) ; 


ACAGCAGCTA 


(SEQ 


ID 


NO 


:143) ; 




CAGCAGCTAA 


(SEQ 


ID 


NO: 144) ; 


AGCAGCTAAC 


(SEQ 


ID 


NO 


:145) j 




GCAGCTAACA 


(SEQ 


ID 


NO: 146) ; 


CAGCTAACAA 


(SEQ 


ID 


NO 


:147) ; 




AGCTAACAAT 


(SEQ 


ID 


NO:148) ; 


GCTAACAATG 


(SEQ 


ID 


NO:149) j 




CTAACAATGC 


(SEQ 


ID 


NO: 150) ; 


TAACAATGCT 


(SEQ 


ID 


NO 


:151) , 




AACAATGCTG 


(SEQ 


ID 


NO: 152) ; 


ACAATGCTGC 


(SEQ 


ID 


NO 


:153) , 




CAATGCTGCT 


(SEQ 


ID 


NO: 154) ; 


AATGCTGCTT 


(SEQ 


ID 


NO 


:155) , 




ATGCTGCTTG 


(SEQ 


ID 


NO: 156) ; 


TGCTGCTTGT 


(SEQ 


ID 


NO 


:157) ( 
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GCTGCTTGTG (SEQ ID NO: 158) 
TGCTTGTGCC (SEQ ID NO: 160) 
CTTGTGCCTG (SEQ ID NO: 162) 
TGTGCCTGGC (SEQ ID NO: 164) 
TGCCTGGCTA (SEQ ID NO: 166) 
CCTGGCTAGA (SEQ ID NO: 168) 
TGGCTAGAAG (SEQ ID NO: 170) 
GCTAGAAGCA (SEQ ID NO: 172) 
TAGAAGCACA (SEQ ID NO: 174) 
GAAGCACAAG (SEQ ID NO: 176) 
AGCACAAGAG (SEQ ID NO: 178) 
CACAAGAGGA (SEQ ID NO: 180) 
CAAGAGGAGG (SEQ ID NO: 182) 

AGAGGAGGAA (SEQ ID NO:184) 

AGGAGGAAGA (SEQ ID NO:186) 

GAGGAAGAGG (SEQ ID NO: 188) . 

GGAAGAGGTG (SEQ ID NO: 190), 

AAGAGGTGGG (SEQ ID NO: 192):, 

GAGGTGGGTT (SEQ ID NO: 194); 

GGTGGGTTTT (SEQ ID NO: 196) ; 

TGGGTTTTCG (SEQ ID NO: 198); 

GGTTTTCCAG (SEQ ID NO:200) ; 

TTTTCCAGTC (SEQ ID NO: 202) j 

TTCCAGTCAC (SEQ ID NO: 204) ; 

CCAGTCACAC (SEQ ID NO: 206) ; 

AGTCACACCT (SEQ ID NO:2&8) ; 

TCACACCTCA (SEQ ID NO:210) ; 

ACACCTCAGG (SEQ ID NO: 212) ; 

ACCTCAGGTA (SEQ ID NO: 2 14) ; 

CTCAGGTACC (SEQ ID NO: 216) ; 

CAGGTACCTT (SEQ ID NO:218) ; 

GGTACCTTTA (SEQ ID NO:220) ; 

TACCTTTAAG (SEQ ID NO: 222) ; 

CCTTTAAGAC (SEQ ID NO: 224) ; 

TTTAAGACCA (SEQ ID NO: 226) ; 

TAAGACCAAT (SEQ ID NO:228) ; 
AGACCAATGA (SEQ ID NO: 230); 
ACCAATGACT (SEQ ID NO:232) ; 
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CTGCTTGTGC (SEQ ID NO: 159); 

GCTTGTGCCT (SEQ ID NO: 161); 

TTGTGCCTGG (SEQ ID NO: 163); 

GTGCCTGGCT (SEQ ID NO: 165); 

GCCTGGCTAG (SEQ ID NO: 167); 

CTGGCTAGAA (SEQ ID NO: 169); 

GGCTAGAAGC (SEQ ID NO: 171) ; 

CTAGAAGCAC (SEQ ID NO: 173) ; 

AGAAGCACAA (SEQ ID NO:175) ; 

AAGCACAAGA (SEQ ID NO:177) ; 

GCACAAGAGG (SEQ ID NO: 179) ; 
ACAAGAGGAG (SEQ ID NO:181) ; 
AAGAGGAGGA (SEQ ID NO: 183) ; 
GAGGAGGAAG (SEQ ID NO:185) ; 
GGAGGAAGAG (SEQ ID NO: 187) ; 
AGGAAGAGGT (SEQ ID NO: 189) ; 
' GAAGAGGTGG (SEQ ID NO: 191) ; 
AGAGGTGGGT (SEQ ID NO: 193) ; 
AGGTGGGTTT (SEQ ID NO: 195); 
GTGGGTTTTC (SEQ ID NO: 197) ; 
GGGTTTTCCA (SEQ ID NO:199); 
GTTTTCCAGT (SEQ ID NO: 201) ; 
TTTCCAGTCA ( SEQ ID NO : 2 0 3 ) ; 
TCCAGTCACA (SEQ ID NO: 205) ; 
CAGTCACACC (SEQ ID N0:207); 
GTCACACCTC (SEQ ID NO: 209) ; 
CACACCTCAG (SEQ ID NO: 211) ; 
CACCTCAGGT (SEQ ID NO: 213) ; 
CCTCAGGTAC (SEQ ID NO: 215) ; 
TCAGGTACCT (SEQ ID NO: 217) ; 
AGGTACCTTT (SEQ ID NO: 219) ; 
GTACCTTTAA ( SEQ ID NO : 2 2 1 ) ; 
ACCTTTAAGA (SEQ ID NO: 223) ; 
CTTTAAGACC (SEQ ID NO: 225) ; 
TTAAGACCAA (SEQ ID NO:227) ; 
AAGACCAATG (SEQ ID NO: 229) ; 
GACCAATGAC (SEQ ID NO: 231) ; 
CCAATGACTT (SEQ ID NO: 233) ; 
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CAATGACTTA (SEQ ID NO: 234) ; 

ATGACTTACA (SEQ ID NO: 23 6) ; 

GACTTACAAG (SEQ ID NO: 238) ; 

CTTACAAGGC (SEQ ID NO:240) / 

TACAAGGCAG (SEQ ID NO: 242) ; 

CAAGGCAGCT (SEQ ID NO: 244) ; 

AGGCAGCTGT (SEQ ID NO: 246) ; 

GCAGCTGTAG (SEQ ID NO: 24 8) ; 

AGCTGTAGAT ( SEQ ID NO : 2 5 0 ) ; 

CTGTAGATCT (SEQ ID NO: 252) ; 

GTAGATCTTA (SEQ ID NO: 254) ; 

AGATCTTAGC (SEQ ID NO: 256) ; 

ATCTTAGCCA (SEQ ID NO: 258) ; 

CTTAGCCACT (SEQ ID NO:260) ; 

TAGCCACTTT (SEQ ID NO: 262); 

GCCACTTTTT (SEQ ID NO : 264 ) ; 

CACTTTTTAA (SEQ ID NO:266) ; 

CTTTTTAAAA (SEQ ID NO:268); 

TTTTAAAAGA (SEQ ID NO:270) ; 
TTAAAAGAAA (SEQ ID NO:272) ; 
AAAAGAAAAG (SEQ ID NO:274) ; 
AAGAAAAGGG (SEQ ID NO:276) ; 
GAAAAGGGGG (SEQ ID NO:278) ; 
AAAGGGGGGA (SEQ ID NO:280) ; 
AGGGGGGACT (SEQ ID NO:282) ; 
GGGGGACTGG (SEQ ID NO: 284) ; 
GGGACTGGAA (SEQ ID NO: 286) ; 
GACTGGAAGG (SEQ ID NO:288) ; 
CTGGAAGGGC (SEQ ID NO:290); 
GGAAGGGCTA (SEQ ID NO: 2 92) ; 
AAGGGCTAAT (SEQ ID NO: 2 94) ; 
GGGCTAATTC (SEQ ID NO:296) ; 
GCTAATTCAC (SEQ ID NO: 298) ; 
TAATTCACTC (SEQ ID NO: 3 00) ; 
ATTCACTCCC (SEQ ID NO: 3 02) ; 
TCACTCCCAA (SEQ ID NO:304); 
ACTCCCAAAG (SEQ ID NO:306) ; 
TCCCAAAGAA (SEQ ID NO: 308) ; 
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AATGACTTAC (SEQ ID NO: 235) ; 

TGACTTACAA (SEQ ID NO: 237) ; 

ACTTACAAGG (SEQ ID NO : 239).; 

TTACAAGGCA (SEQ ID NO:241) ; 

ACAAGGCAGC (SEQ ID NO: 243) ; 

AAGGCAGCTG (SEQ ID NO: 245) ; 

GGCAGCTGTA (SEQ ID NO:247); 

CAGCTGTAGA (SEQ ID NO:249); 

GCTGTAGATC (SEQ ID NO:251); 

TGTAGATCTT (SEQ ID NO:253); 

TAGATCTTAG (SEQ ID NO: 255) ; 

GATCTTAGCC (SEQ ID NO:257) ; 

TCTTAGCCAC (SEQ ID NO: 259) ; 

TTAGCCACTT (SEQ ID NO: 261) ; 

AGCCACTTTT (SEQ ID NO: 263) ; 

CCACTTTTTA ( SEQ : ID NO : 2 6 5 ) ; 

ACTTTTTAAA : ( SEQ ID NO : 2 6 7 ) ; 

TTTTTAAAAG (SEQ ID NO:269) ; 

TTTAAAAGAA (SEQ ID NO:271) ; 

TAAAAGAAAA (SEQ ID NO:273) ; 
AAAGAAAAGG ( SEQ ID NO : 2 75 ) ; 
AGAAAAGGGG (SEQ ID NO:277) ; 
AAAAGGGGGG (SEQ ID NO:279) ; 
AAGGGGGGAC (SEQ ID NO:281) ; 
GGGGGGACTG (SEQ ID NO:283); 
GGGGACTGGA (SEQ ID NO:285); 
GGACTGGAAG (SEQ ID NO:287); 
ACTGGAAGGG (SEQ ID NO:289); 
TGGAAGGGCT (SEQ ID NO:291); 
GAAGGGCTAA (SEQ ID NO: 293) ; 
AGGGCTAATT (SEQ ID NO:295); 
GGCTAATTCA (SEQ ID NO:297); 
CTAATTCACT (SEQ ID NO:299) ; 
AATTCACTCC (SEQ ID NO: 301) ; 
TTCACTCCCA (SEQ ID NO:303); 
CACTCCCAAA (SEQ ID NO: 305) ; 
CTCCCAAAGA (SEQ ID NO: 3 07) ; 
CCCAAAGAAG (SEQ ID NO:309) ; 
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CCAAAGAAGA (SEQ ID NO: 310) 
AAAGAAGACA (SEQ ID NO: 312) 
AGAAGACAAG (SEQ ID NO: 3 14) 
AAGACAAGAT (SEQ ID NO: 316) 
GACAAGATAT (SEQ ID NO: 318) 
CAAGATATCC (SEQ ID NO: 320) 
AGATATCCTT (SEQ ID NO: 322) 
ATATCCTTGA (SEQ ID NO: 324) 
ATCCTTGATC (SEQ ID NO: 326) 
CCTTGATCTG (SEQ ID NO: 328) 
TTGATCTGTG (SEQ ID NO: 330) 
GATCTGTGGA (SEQ ID NO:332) 
TCTGTGGATC (SEQ ID NO: 334) 
TGTGGATCTA (SEQ ID NO: 336), 
TGGATCTACC (SEQ ID NO: 338) ; 
GATCTACCAC (SEQ ID NO: 340); 
J'CTACCACAC (SEQ ID NO: 342) ; 
TACCACACAC (SEQ ID NO: 344); 
CCACACACAA (SEQ ID NO:346) ; 
ACACACAAGG (SEQ ID NO: 348) j 
ACACAAGGCT (SEQ ID NO: 350) ; 
ACAAGGCTAC (SEQ ID NO: 352) ; 
AAGGCTACTT (SEQ ID NO: 354) ; 
GGCTACTTCC (SEQ ID NO:356) ; 
CTACTTCCCT ( SEQ ID NO : 3 5 8 ) ; 

ACTTCCCTGA (SEQ ID NO: 3 60) ; 

TTCCCTGATT (SEQ ID NO: 3 62); 

CCCTGATTGG (SEQ ID NO: 3 64) ; 

CTGATTGGCA (SEQ ID NO : 3 66 ) ; 

GATTGGCAGA (SEQ ID NO:368); 

TTGGCAGAAC (SEQ ID NO: 3 70) ; 

GGCAGAACTA (SEQ ID NO:372); 

CAGAACTACA (SEQ ID NO: 374); 

GAACTACACA (SEQ ID NO:376) ; 

ACTACACACC (SEQ ID NO: 3 78) ; 

TACACACCAG (SEQ ID NO: 380) ; 

CACACCAGGG (SEQ ID NO:382) ; 

CACCAGGGCC (SEQ ID NO:384) ; 



- 244 - 

CAAAGAAGAC (SEQ ID NO: 311) 
AAGAAGACAA (SEQ ID NO: 3 13) 
GAAGACAAGA (SEQ ID NO: 315) 
AGACAAGATA (SEQ ID NO: 317) 
ACAAGATATC (SEQ ID NO: 3 19) 
AAGATATCCT (SEQ ID NO: 321) 
GATATCCTTG (SEQ ID NO: 323) 
TATCCTTGAT (SEQ ID NO: 325) 
TCCTTGATCT (SEQ ID NO: 327) 
CTTGATCTGT (SEQ ID NO: 329) 
TGATCTGTGG (SEQ ID NO: 331) 
ATCTGTGGAT (SEQ ID NO: 333) 
CTGTGGATCT ( SEQ ID NO : 3 3 5 ) 
GTGGATCTAC (SEQ ID NO: 337) ; 
GGATCTACCA (SEQ ID NO: 339) ; 

ATCTACCACA (SEQ ID NO: 341) ; 
CTACCACACA (SEQ ID NO: 343) ; 

ACCACACACA (SEQ ID NO: 345) j 
CACACACAAG (SEQ ID NO:347) j 

CACACAAGGC (SEQ ID NO: 34 9) j 

CACAAGGCTA (SEQ ID NO: 351) ; 

CAAGGCTACT (SEQ ID NO:353); 

AGGCTACTTC ( SEQ ID NO : 3 5 5 ) ; 

GCTACTTCCC (SEQ ID NO: 357) ; 

TACTTCCCTG (SEQ ID NO: 359) ; 

CTTCCCTGAT (SEQ ID NO: 3 61) ; 

TCCCTGATTG (SEQ ID NO: 3 63) ; 

CCTGATTGGC (SEQ ID NO: 365) ; 

TGATTGGCAG (SEQ ID NO: 367) ; 

ATTGGCAGAA (SEQ ID NO:369) / 

TGGCAGAACT (SEQ ID NO:371) ; 

GCAGAACTAC (SEQ ID NO: 3 73) ; 

AGAACTACAC (SEQ ID NO:375) ; 
.AACTACACAC (SEQ ID NO:377) ; 

CTACACACCA (SEQ ID NO:379) ; 

ACACACCAGG (SEQ ID NO:381) ; 

ACACCAGGGC (SEQ ID NO:383) ; 

ACCAGGGCCA (SEQ ID NO: 3 85) ; 
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CCAGGGCCAG (SEQ ID NO: 386) 
AGGGCCAGGG (SEQ ID NO: 388) 
GGCCAGGGGT (SEQ ID NO: 390) 
CCAGGGGTCA (SEQ ID NO: 3 92) 
AGGGGTCAGA (SEQ ID NO: 3 94) 
GGGTCAGATA (SEQ ID NO: 396) 
GTCAGATATC (SEQ ID NO: 398) 
CAGATATCCA (SEQ ID NO: 400) 
GATATCCACT (SEQ ID NO: 402) 
TATCCACTGA (SEQ ID NO: 4 04) 
TCCACTGACC (SEQ ID NO: 406) 
CACTGACCTT (SEQ ID NO:408) 
CTGACCTTTG (SEQ ID NO: 4 10) 
GACCTTTGGA (SEQ ID NO: 4 12) 
CCTTTGGATG (SEQ ID NO: 4 14) 
TTTGGATGGT (SEQ ID NO: 416) 
TGGATGGTGC ( SEQ ID NO: 418) ■ 
GATGGTGCTA (SEQ ID NO: 420) ; 
TGGTGCTACA (SEQ ID NO: 422) \ 
GTGCTACAAG (SEQ ID NO: 424) j 
GCTACAAGCT (SEQ ID NO:426) ; 
TACAAGCTAG (SEQ ID NO: 428) ; 
CAAGCTAGTA (SEQ ID NO: 430) ; 
AGCTAGTACC (SEQ ID NO: 432) ; 
CTAGTACCAG (SEQ ID NO: 434) ; 

AGTACCAGTT (SEQ ID NO:436) ; 
TACCAGTTGA (SEQ ID NO:438) ; 
CCAGTTGAGC (SEQ ID NO: 440) ; 

AGTTGAGCCA (SEQ ID NO: 442) ; 

TTGAGCCAGA (SEQ ID NO: 444) ; 

GAGCCAGATA (SEQ ID NO: 44 6) ; 

GCCAGATAAG (SEQ ID NO: 44 8) ; 

CAGATAAGGT (SEQ ID NO: 450) ; 

GATAAGGTAG (SEQ ID NO: 452) ; 

TAAGGTAGAA (SEQ ID NO: 454) ; 

AGGTAGAAGA (SEQ ID NO: 456) ; 

GTAGAAGAGG (SEQ ID NO: 458) ; 

AGAAGAGGCC (SEQ ID NO:460) ; 
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CAGGGCCAGG (SEQ ID NO: 387) 
GGGCCAGGGG (SEQ ID NO: 389) 
GCCAGGGGTC (SEQ ID NO: 3 91) 
CAGGGGTCAG (SEQ ID NO: 3 93) 
GGGGTCAGAT (SEQ ID NO: 395) 
GGTCAGATAT (SEQ ID NO: 397) 
TCAGATATCC (SEQ ID NO: 399) 
AGATATCCAC (SEQ ID NO:401) 
ATATCCACTG (SEQ ID NO: 403) 
ATCCACTGAC (SEQ ID NO: 4 05) 
CCACTGACCT (SEQ ID NO: 407) 
ACTGACCTTT (SEQ ID NO:409) 
TGACCTTTGG (SEQ ID NO: 411) 
ACCTTTGGAT (SEQ ID NO: 4 13) 
CTTTGGATGG (SEQ ID NO: 415) 
TTGGATGGTG (SEQ ID NO: 417) 
GGATGGTGCT (SEQ ID NO:419) 
ATGGTGCTAC (SEQ ID NO:421) 
GGTGCTACAA (SEQ ID NO: 423) 
TGCTACAAGC (SEQ ID NO: 425) , 
CTACAAGCTA (SEQ ID NO: 427) ; 
ACAAGCTAGT (SEQ ID NO: 429) , 
AAGCTAGTAC (SEQ ID NO:431) ; 
GCTAGTACCA (SEQ ID NO: 433) ; 
TAGTACCAGT (SEQ ID NO: 435) ; 
GTACCAGTTG (SEQ ID NO: 437) ; 
ACCAGTTGAG (SEQ ID NO:439) ; 
CAGTTGAGCC (SEQ ID NO: 441) ; 
GTTGAGCCAG (SEQ ID NO: 443) ; 
TGAGCCAGAT (SEQ ID NO: 44 5) ; 
AGCCAGATAA (SEQ ID NO: 447) ; 
CCAGATAAGG (SEQ ID NO: 44 9) ; 
AGATAAGGTA (SEQ ID NO: 451) ; 
. ATAAGGTAGA (SEQ ID NO: 453) ; 
AAGGTAGAAG (SEQ ID NO: 455) ; 
GGTAGAAGAG (SEQ ID NO:457) ; 
TAGAAGAGGC (SEQ ID NO:459) ; 
GAAGAGGCCA (SEQ ID NO: 461) ; 
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AAGAGGCCAA (SEQ ID NO:462) 
GAGGCCAATA (SEQ ID NO: 4 64) 
GGCCAATAAA (SEQ ID NO: 466) 
CCAATAAAGG (SEQ ID NO; 468) 
AATAAAGGAG (SEQ ID NO: 470) 
TAAAGGAGAG (SEQ ID NO: 472) 
AAGGAGAGAA (SEQ ID NO: 4 74) 
GGAGAGAACA (SEQ ID NO: 476) 
AGAGAACACC (SEQ ID NO:478) 
AGAACACCAG (SEQ ID NO: 480) 
AACACCAGCT (SEQ ID NO: 4 82) 
CACCAGCTTG (SEQ ID NO: 4 84) 
CCAGCTTGTT (SEQ ID NO: 486) 
AGCTTGTTAC (SEQ ID NO: 488) 
CTTGTTACAC (SEQ ID NO:490) 
TGTTACACCC (SEQ ID NO: 4 92) 
TTACACCCTG (SEQ ID NO: 4 94) 
ACACCCTGTG (SEQ ID NO: 496) ( 
ACCCTGTGAG (SEQ ID NO:498) , 
CCTGTGAGCC (SEQ ID NO: 500) ; 
TQTGAGCCTG (SEQ ID NO: 5 02); 
TGAGCCTGCA (SEQ ID NO: 504) ; 
AGCCTGCATG (SEQ ID NO:506) ; 
CCTGCATGGA (SEQ ID NO:508) ; 
TGCATGGAAT (SEQ ID NO: 510); 
CATGGAATGG (SEQ ID NO: 512); 
TGGAATGGAT (SEQ ID NO: 514) ; 
GAATGGATGA (SEQ ID NO: 516) ; 
ATGGATGACC (SEQ ID NO: 518) ; 
GGATGACCCT (SEQ ID NO:520); 
ATGACCCTGA (SEQ ID NO: 522) ; 
GACCCTGAGA (SEQ ID NO: 524) ; 
CCCTGAGAGA (SEQ ID NO: 526) ; 
CTGAGAGAGA (SEQ ID NO: 528); 
GAGAGAGAAG (SEQ ID NO: 530); 
GAGAGAAGTG (SEQ ID NO:532); 
GAGAAQTGTT (SEQ ID NO: 534); 
GAAGTGTTAG (SEQ ID NO:536); 
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AGAGGCCAAT (SEQ ID NO:463) ; 

AGGCCAATAA (SEQ ID NO:465) ; 

GCCAATAAAG (SEQ ID NO:467) ; 

CAATAAAGGA (SEQ ID NO:469) ; 

ATAAAGGAGA (SEQ ID NO:471) ; 

AAAGGAGAGA (SEQ ID NO: 473) ; 

AGGAGAGAAC (SEQ ID NO: 475) ; 

GAGAGAACAC (SEQ ID NO:477) ; 

GAGAACACCA (SEQ ID NO: 479) ; 

GAACACCAGC (SEQ ID NO:481) ; 

ACACCAGCTT (SEQ ID NO: 483) ; 

ACCAGCTTGT (SEQ ID NO: 485) ; 

CAGCTTGTTA (SEQ ID NO: 4 87) ; 

GCTTGTTACA (SEQ ID NO:489) ; 

TTGTTACACC (SEQ ID NO:491) ; 

GTTACAGCCT (SEQ ID NO: 4 93) ; 
TACACCCTGT (SEQ ID NO : 495) ; 

CACCCTGTGA (SEQ ID NO:497) ; 
CCCTGTGAGC ( SEQ ID NO : 4 9 9 ) ;. 
CTGTGAGCCT (SEQ ID NO : 501) ; 
GTGAGCCTGC (SEQ ID NO: 503) ; 
GAGCCTGCAT (SEQ ID NO: 505) ; 
GCCTGCATGG (SEQ ID NO: 507) ; 
CTGCATGGAA (SEQ ID NO: 509) ; 
GCATGGAATG (SEQ ID NO: 511) ; 
ATGGAATGGA (SEQ ID NO: 513) ; 
GGAATGGATG (SEQ ID NO:515); 
AATGGATGAC (SEQ ID NO: 517) ; 
TGGATGACCC (SEQ ID NO: 519) ; 
GATGACCCTG (SEQ ID NO: 521) ; 
TGACCCTGAG (SEQ ID NO: 523) ; 
ACCCTGAGAG (SEQ ID NO:525); 
CCTGAGAGAG (SEQ ID NO: 527) ; 
TGAGAGAGAA (SEQ ID NO:529); 
AGAGAGAAGT (SEQ ID NO:531); 
AGAGAAGTGT (SEQ ID NO:533); 
AGAAGTGTTA (SEQ ID NO: 535) ; 
AAGTGTTAGA (SEQ ID NO: 537) ; 
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AGTGTTAGAG (SEQ ID NO: 53 8) 
TGTTAGAGTG (SEQ ID NO: 540) 
TTAGAGTGGA (SEQ ID NO: 542) 
AGAGTGGAGG (SEQ ID NO: 544) 
AGTGGAGGTT (SEQ ID NO: 54 6) 
TGGAGGTTTG (SEQ ID NO: 548) 
GAGGTTTGAC (SEQ ID NO: 550) 
GGTTTGACAG (SEQ ID NO: 552) 
TTTGACAGCC (SEQ ID NO: 554) 
TGACAGCCGC (SEQ ID NO: 556) 
ACAGCCGCCT (SEQ ID NO: 558) 
AGCCGCCTAG (SEQ ID NO: 560) 
CCGCCTAGCA (SEQ ID NO: 562) 
GCCTAGCATT (SEQ ID NO: 564) 
CTAGCATTTC (SEQ ID NO: 566) 
AGCATTTCAT (SEQ ID NO:568) 
CATTTCATCA . (SEQ ID NO: 570) 
TTTCATCACG (SEQ ID NO: 572) 
TCATCACGTG (SEQ ID NO: 574) 
ATCACGTGGC . (SEQ ID NO : 576 ) 
CACGTGGCCC. (SEQ ID NO: 578) 
CGTGGCCCGA (SEQ ID NO: 580) 
TGGCCCGAGA (SEQ ID NO:582) , 
GCCCGAGAGC (SEQ ID NO: 584) ; 
CCGAGAGCTG ( SEQ ID NO : 5 8 6 ) j 
GAGAGCTGCA (SEQ ID NO: 588) j 
GAGCTGCATC (SEQ ID NO: 590) ; 
GCTGCATCCG (SEQ ID NO: 592) ; 
TGCATCCGGA (SEQ ID NO: 5 94); 
CATCCGGAGT (SEQ ID NO:596) ; 
TCCGGAGTAC (SEQ ID NO: 598) ; 
CGGAGTACTT ( SEQ ID NO: 600) ; 
GAGTACTTCA (SEQ ID NO: 602); 
GTACTTCAAG (SEQ ID NO: 604); 
ACTTCAAGAA (SEQ ID NO:606) ; 
TTCAAGAACT (SEQ ID NO:608) ; 
CAAGAACTGC (SEQ ID NO:610) ; 
AGAACTGCTG (SEQ ID NO: 612) ; 



GTGTTAGAGT 


(SEQ 


ID 


NO 


:539) 


GTTAGAGTGG 


(SEQ 


ID 


NO 


:541) 


TAGAGTGGAG 


(SEQ 


ID 


NO 


:543) 


GAGTGGAGGT 


(SEQ 


ID 


NO 


:545) 


GTGGAGGTTT 


(SEQ 


ID 


NO 


:547) 


GGAGGTTTGA 


(SEQ 


ID 


NO 


:549) 


AGGTTTGACA 


(SEQ 


ID 


NO 


:551) 


GTTTGACAGC 


(SEQ 


ID 


NO 


:553) 


TTGACAGCCG 


(SEQ 


ID 


NO 


:555) 


GACAGCCGCC 


(SEQ 


ID 


NO 


:557) 


CAGCCGCCTA 


(SEQ 


ID 


NO 


:559) 


GCCGCCTAGC 


(SEQ 


ID 


NO 


:561) 


CGCCTAGCAT 


(SEQ 


ID 


NO 


:563) 


CCTAGCATTT 


(SEQ 


ID 


NO: 


•565) 


TAGCATTTCA 


(SEQ 


ID 


NO: 


:567) 


GCATTTCATC 


(SEQ 


ID 


NO: 


569) 


ATTTCATCAC 


(SEQ 


ID 


NO: 


571) 


TTCATCACGT 


(SEQ 


ID 


NO: 


573) 


CATCACGTGG 


(SEQ 


ID 


NO: 


575) 


TCACGTGGCC 


(SEQ 


ID 


NO: 


577) 


ACGTGGCCCG 


(SEQ 


ID 


NO: 


579) , 


GTGGCCCGAG 


(SEQ 


ID 


NO: 


581) , 


GGCCCGAGAG 


(SEQ 


ID 


NO: 


583) , 


CCCGAGAGCT 


(SEQ 


ID 


NO: 


585) , 


CGAGAGCTGC 


(SEQ 


ID 


NO: 


587) j 


AGAGCTGCAT 


(SEQ 


ID 


NO: 


589) ; 


AGCTGCATCC 


(SEQ 


ID 


NO: 


591) j 


CTGCATCCGG 


(SEQ 


ID 


NO: 


593) ; 


GCATCCGGAG 


(SEQ 


ID 


NO: 


595) ; 


ATCCGGAGTA 


(SEQ 


ID 


NO: 


597) ; 


CCGGAGTACT 


(SEQ 


ID 


NO: 


599) ; 


GGAGTACTTC 


(SEQ 


ID 


NO: 


601) ; 


AGTACTTCAA 


(SEQ 


ID 


NO: 


603) ; 


TACTTCAAGA 


(SEQ 


ID 


NO: 


605) ; 


CTTCAAGAAC 


(SEQ 


ID 


NO: 


607) ; 


TCAAGAACTG 


(SEQ 


ID 


NO: 


609) ; 


AAGAACTGCT 


(SEQ 


ID 


NO: 


611) ; 


GAACTGCTGA 


(SEQ 


ID 


NO: 


613) . 
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92. A method according to claim 86 wherein said deletion encompasses at least one 
of the following decanucleotides from the LTR region of HIV- 1 strain NL4-3: 



GCTTTTTGCC 


(SEQ 


ID NO: 652) ; 


CTTTTTGCCT 


(SEQ 


ID NO: 653) ; 


TTTTTGCCTG 


(SEQ 


ID NO: 654) ; 


TTTTGCCTGT 


(SEQ 


ID NO: 655) ; 


TTTGCCTGTA 


(SEQ 


ID NO: 656) ; 


TTGCCTGTAC 


(SEQ 


ID NO: 657) ; 


TGCCTGTACT 


(SEQ 


ID NO: 658) ; 


GCCTGTACTG 


(SEQ 


ID NO: 659) / 


CCTGTACTGG 


(SEQ 


ID NO: 660) ; 


CTGTACTGGG 


(SEQ 


ID NO: 661) ; 


TGTACTGGGT 


(SEQ 


ID NO: 662) ; 


GTACTGGGTC 


(SEQ 


ID NO: 663) ; 


TACTGGGTCT 


(SEQ 


ID NO: 664) ; 


ACTGGGTCTC 


(SEQ 


ID NO: 665) ; 


CTGGGTCTCT 


(SEQ 


ID NO: 666) ; 


TGGGTCTCTC 


(SEQ 


ID NO: 667) ; 


GGGTCTCTCT 


(SEQ 


ID NO: 668) ; 


GGTCTCTCTG 


(SEQ 


ID NO: 669) ; 


GTCTCTCTGG 


(SEQ 


ID NO: 670) ; 


TCTCTCTGGT 


(SEQ 


ID NO: 671) ; 


CTCTCTGGTT 


(SEQ 


ID NO: 672) ; 


TCTCTGGTTA 


(SEQ 


ID NO: 673) ; 


CTCTGGTTAG 


(SEQ 


ID NO: 674) ; 


TCTCTGGTTA 


(SEQ ID NO: 675) ; 


CTGGTTAGAC 


(SEQ 


ID NO: 676) ; 


TGGTTAGACC 


(SEQ 


ID NO: 677) ; 


GGTTAGACCA 


(SEQ 


ID NO: 678) ; 


GTTAGACCAG 


(SEQ 


ID NO: 679) ; 


TTAGACCAGA 


(SEQ 


ID NO: 680) ; 


TAGACCAGAT 


(SEQ 


ID NO: 681) ; 


AGACCAGATC 


(SEQ 


ID NO: 682) ; 


GACCAGATCT 


(SEQ 


ID NO: 683) ; 


ACCAGATCTG 


(SEQ 


ID NO: 684) ; 


CCAGATCTGA 


(SEQ 


ID NO: 685) ; 


CAGATCTGAG 


(SEQ 


ID NO: 686) ; 


AGATCTGAGC 


(SEQ 


ID NO: 687) ; 


GATCTGAGCC 


(SEQ 


ID NO: 688) ; 


ATCTGAGCCT 


(SEQ 


ID NO: 689) ; 


TCTGAGCCTG 


(SEQ 


ID NO: 690) ; 


CTGAGCCTGG 


(SEQ 


ID NO: 691) ; 


TGAGCCTGGG 


(SEQ 


ID NO: 692) ; 


GAGCCTGGGA 


(SEQ 


ID NO: 693) ; 


AGCCTGGGAG 


(SEQ 


ID NO: 694) ; 


GCCTGGGAGC 


(SEQ 


ID NO: 695) ; 


CCTGGGAGCT 


(SEQ 


ID NO: 696) ; 


CTGGGAGCTC 


(SEQ 


ID NO: 697) ; 


TGGGAGCTCT 


(SEQ 


ID NO: 698) ; 


GGGAGCTCTC 


(SEQ 


ID NO: 699) ; 


GGAGCTCTCT 


(SEQ 


ID NO: 700) ; 


GAGCTCTCTG 


(SEQ 


ID NO: 701) ; 


AGCTCTCTGG 


(SEQ 


ID NO: 702) ; 


GCTCTCTGGC 


(SEQ 


ID NO: 703) / 


CTCTCTGGCT 


(SEQ 


ID NO: 704) ; 


TCTCTGGCTA 


(SEQ 


ID NO: 705) ; 


CTCTGGCTAA 


(SEQ 


ID NO: 706) ; 


TCTGGCTAAC 


(SEQ 


ID NO: 707) ; 


CTGGCTAACT 


(SEQ 


ID NO: 708) ; 


TGGCTAACTA 


(SEQ 


ID NO: 709) ; 


GGCTAACTAG 


(SEQ 


ID NO: 710) ; 


GCTAACTAGG 


(SEQ 


ID NO: 711) ; 


CTAACTAGGG . 


(SEQ 


ID NO: 712) ; 


TAACTAGGGA 


(SEQ 


ID NO: 713) / 


AACTAGGGAA 


(SEQ 


ID NO: 714) ; 


ACTAGGGAAC 


(SEQ 


ID NO: 715) ; 


CTAGGGAACC 


(SEQ 


ID NO: 716) ; 


TAGGGAACCC 


(SEQ 


ID NO: 717) ; 


AGGGAACCCA 


(SEQ 


ID NO: 718) ; 


GGGAACCCAC 


(SEQ 


ID NO: 719) ; 
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GGAACCCACT 


(SEQ 


ID 


NO 


:720) 


AACCCACTGC 


(SEQ 


ID 


NO 


:722) 


CCCACTGCTT 


(SEQ 


ID 


NO 


:724) 


CACTGCTTAA 


(SEQ 


ID 


NO 


:726) 


CTGCTTAAGC 


(SEQ 


ID 


NO 


:728) 


G'CTTAAGCCT 


(SEQ 


ID 


NO 


:730) 


TTAAGCCTCA 


(SEQ 


ID 


NO 


:732) 


AAGCCTCAAT 


(SEQ 


ID 


NO 


:734) 


GCCTCAATAA 


(SEQ 


ID 


NO 


:736) 


CTCAATAAAG 


(SEQ 


ID 


NO 


:738) 


CAATAAAGCT 


(SEQ 


ID 


NO 


:740) 


ATAAAGCTTG 


(SEQ 


ID 


NO 


:742) 


AAAGCTTGCC 


(SEQ 


ID 


NO 


:744) 


AGCTTGCCTT 


(SEQ 


ID 


NO 


:746) 


CTTGCCTTGA 


(SEQ 


ID 


NO 


:748) 


TGCCTTGAGT 


(SEQ 


ID 


NO: 


t750) 


CCTTGAGTGC 


(SEQ 


ID 


NOs 


752) 


TTGAGTGCTT 


(SEQ 


ID 


NO: 


754) 


GAGTGCTTCA 


..(SEQ 


ID 


NO: 


756) 


GTGCTTCAAG 


(SEQ 


ID 


NO: 


758) 


GCTTCAAGTA 


(SEQ 


IP 


NO: 


760) 


TTCAAGTAGT 


(SEQ 


ID 


NO: 


762) 


CAAGTAGTGT 


(SEQ 


ID 


NO: 


764) 


AGTAGTGTGT 


(SEQ 


ID 


NO: 


766) 


TAGTGTGTGC 


(SEQ 


ID 


NO: 


768) 


GTGTGTGCCC 


(SEQ 


ID 


NO: 


770) 


GTGTGCCCGT 


(SEQ 


ID 


NO: 


772) 


GTGCCCGTCT 


(SEQ 


ID 


NO: 


774) 


GCCCGTCTGT 


(SEQ 


ID 


NO: 


776) 


CCGTCTGTTG 


(SEQ 


ID 


NO: 


778) 


GTCTGTTGTG 


(SEQ 


ID 


NO: 


780) 


CTGTTGTGTG 


(SEQ 


ID 


NO: 


782) 


GTTGTGTGAC 


(SEQ 


ID 


NO: 


784) 


TGTGTGACTC 


(SEQ 


ID 


NO: 


786) 


TGTGTGACTC 


(SEQ 


ID 


NO: 


788) 


TGTGACTCTG 


(SEQ 


ID 


NO: 


790) 
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GAACCCACTG 


(SEQ 


ID 


NO 


:721) 


ACCCACTGCT 


(SEQ 


ID 


NO 


:723) 


CCACTGCTTA 


(SEQ 


ID 


NO 


:725) 


ACTGCTTAAG 


(SEQ 


ID 


NO 


:727) 


TGCTTAAGCC 


(SEQ 


ID 


NO 


:729) 


CTTAAGCCTC 


(SEQ 


ID 


NO 


:731) 


TAAGCCTCAA 


(SEQ 


ID 


NO 


:733) 


AGCCTCAATA 


(SEQ 


ID 


NO 


:735) 


CCTCAATAAA 


(SEQ 


ID 


NO 


:737) 


TCAATAAAGC 


(SEQ 


ID 


NO 


:739) 


AATAAAGCTT 


(SEQ 


ID 


NO 


:741) 


TAAAGCTTGC 


(SEQ 


ID 


NO 


:743) 


AAGCTTGCCT 


(SEQ 


ID 


NO 


:745) , 


GCTTGCCTTG 


(SEQ 


ID 


NO: 


•747) , 


TTGCCTTGAG 


(SEQ 


ID 


NO: 


749) ; 


GCCTTGAGTG 


(SEQ 


ID 


NO: 


751) j 


CTTGAGTGCT 


(SEQ 


ID 


NO: 


753) ; 


TGAGTGCTTC 


(SEQ 


ID 


NO: 


755) ; 


AGTGCTTCAA 


(SEQ 


ID 


NO: 


757) ; 


TGCTTCAAGT 


(SEQ 


ID 


NO: 


759) ; 


CTTCAAGTAG 


(SEQ 


ID 


NO: 


761) ; 


TCAAGTAGTG 


(SEQ 


ID 


NO: 


763) ; 


AAGTAGTGTG 


(SEQ 


ID 


NO: 


765) ; 


GTAGTGTGTG 


(SEQ 


ID 


NO: 


767) ; 


AGTGTGTGCC 


(SEQ 


ID 


NO: 


769) ; 


TGTGTGCCCG 


(SEQ 


ID 


NO: 


771) ; 


TGTGCCCGTC 


(SEQ 


ID 


NO: 


773) ; 


TGCCCGTCTG 


(SEQ 


ID 


NO: 


775) ; 


CCCGTCTGTT 


(SEQ 


ID 


NO: 


777) ; 


CGTCTGTTGT 


(SEQ 


ID 


NO: 


779) ; 


TCTGTTGTGT 


(SEQ 


ID 


NO: 


781) ; 


TGTTGTGTGA 


(SEQ 


ID 


NO: 


783) ; 


TTGTGTGACT 


(SEQ 


ID 


NO: 


785) ; 


GTGTGACTCT 


(SEQ 


ID 


NO: 


787) ; 


GTGTGACTCT 


(SEQ 


ID 


NO: 


789) ; 


GTGACTCTGG 


(SEQ 


ID 


NO: 


791) ; 
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TGACTCTGGT (SEQ ID NO: 792); GACTCTGGTA (SEQ ID NO: 793) ; 

ACTCTGGTAA (SEQ ID NO: 794) ; CTCTGGTAAC (SEQ ID NO: 795); 

TCTGGTAACT (SEQ ID NO: 796); CTGGTAACTA (SEQ ID NO: 797) ; 

TGGTAACTAG (SEQ ID NO:798) ; GGTAACTAGA (SEQ ID NO:799). 
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TGGAAGGGCTAATTCACTCACGGAAAAGACCAGTTGAACCAG 

AAGAAGATAGAAGAGGCCATGAAGAAGAAAACAACAGATTGT 

TCTGCTTGCTCAGCTGGGGACTTTCCAGAAGGCGCGGCCTGA 

GTGACTAAGCCCCGTTGGGGACTTTCCGAAGAGGCATGAAGG 

GACTTTCCAAGGCAGGCGTGGCCTGGGCGGGACTGGGGAGTG 

GCGAGCCCTCAGATGCTGCATATAAGCAGCTGCTTTCTGCCT 

GTACTGGGTCTCTCTGGTTAGACCAGATCTGAGCCTGGGAGC 

TCTCTGGCTAGCTAGGGAACCCACTGCTTAAGCCTCAATAAA 

GCTTGCCTTGAGTGCTTCAAGTAGTGTGTGCCCGTCTGTTGT 

GTGACTCTGGTATCTAGAGATCCCTCAGACCATTTTAGTCCG 

TGTGGAAAATCTCTAGCAGTGGCGCCCGAACAGGGACTTGAA 

AGCGAAAGGAAAACCAGAGGAGCTCTCTCGACGCAGGACTCG 

GCTTGCTGAAGCGCGCACGGCAAGAGGCGAGGGGCGGCGACT 

GGTGAGTACGCCGAAAATTTTGACTAGCGGAGGCTAGAAGGA 

GAGAGATGGGTGCGAGAGCGTCAATATTAAGCGGGGGAAAAT 

TAGATAGATGGGAGAAAATTCGGTTAAGGCCAGGAGGAAAGA 

AAAAGTATAAATTAAAACATATAGTATGGGCAAGCAGGGAGC * 

TAGAACGATTCGCAGTCAATCCTGGCCTGTTGGAAACATCAG 

AAGGCTGTAGACAAATACTGGGACAGTTACACCCGTCCCTTC ; 

AGACAGGATCAGAAGAACTTAAATCAGTATATAATGCAGTAG 

CAGTCCTCTATTGTGTGCATCAAAACATAGACTATAAAGGACA 

CCAAGGAAGCTTTAGAAAAGATAGAGGAAGAGCAAAACAAAT 

GTAAGAAAAAAGCACAGCAAGCAGCAGCACAGCAAGCAGCAG 

CTGGCACAGGAAACAGCAACCCGGTCAGCCAAAATTACCCTA 

TAGTACAGAACATGCAGGGGCAAATGGTACATCAGGCCATAT 

CACCTAGAACTTTAAATGCATGGGTAAAAGTAATAGAAGAGA 

AGGCTTTCAGCCCAGAGGTAATACCCATGTTTTCAGCATTAT 

CAGAAGGAGCCAC CCCACAAGATTTAAACACCATGCTAAACA 

CAGTGGGGGGACATCAAGCAGCTATGCAAATGTTAAAAGAGA 

CCATCAATGAGGAAGCTGCAGAATGGGATAGATTACATCCAG 

CGCAGGCAGGGCCTGTTGCACCAGGCCAGATGAGAGACCCAA 

GGGGAAGTGACATAGCAGGAACTACTAGTACCCTTCAGGAAC 

AAATAGGATGGATGACAGGTAATCCAGCTATCCCAGTAGGAG 

AAATCTATAAAAGATGGATAATCCTGGGATTAAATAAAATAG 

TAAGGATGTATAGCCCTATCAGCATTCTGGACATAAAACAAG 

GACCAAAGGAACCCTTTAGAGACTATGTAGACCGGTTCTATA 

AAACTCTAAGAGCCGAGCAAGCTACACAGGAGGTAAAAAATT 

GGATGACAGAAACCTTGTTGGTCCAAAATGCAAACCCAGATT 

GTAAGACTATTTTAAAAGCATTGGGACCAGCAGCTACACTAG 
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AAGAAATGATGACAGCATGTCAGGGAGTGGGAGGACCCAGCC 

ATAAAGCAAGAGTTTTGGCAGAAGCAATGAGCCAAGCAACAA 

ATGCAGCTACTGTAATGATGCAGAGAAGCAATTTTAGAAACC 

AAAGAAAGAATGTTAAGTGTTTCAATTGTGGCAAAGAAGGGC 

ACATAGCCAGAAATTGCAGGGCTCCTAGGAAAAGGGGCTGTT 

GGAAATGTGGAAAGGAAGGACACCAAATGAAAGATTGTACTG 

AGAGACAGGCTAATTTTTTAGGGAAAATCTGGCCTTCCCACA 

AGGGGAGGCCAGGGAACTTTCTTCAGAGCAGGCCAGAACCAA 

CAGCCCCTCTCCAGGGCAGGCCGGAGCCATCAGCCCCGCCAG 

AAGAGAGCTTCAGGTTTGGGGAGGAGACAACAACTCCCTCTC 

AGAAGCAGGAGCCGATAGACAGGGACAGGGATCTGTATCCTT 

TAGCTTCCCTCAGATCACTCTTTGGCAACGACCCCTCGTCAC 

AATAAAGATAGGGGGGCAGCTGAAGGAAGCTCTATTAGATAC 

AGGAGCAGATGATACAGTATTAGAAGACATGCATTTGCCAGG 

AAAATGGAAACCAAAAATGATAGGGGGAATTGGAGGTTTTAT. 

CAAAGTAAAACAATATGATGAAATTCTTGTAGAAATCTGTGG 

ACATAAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGT 

CAACATAATTGGAAGAAATCTGTTGACTCAGATTGGTTGCAC ■ 

TTTAAATTTTCCCATTAGTCCTATTGAAACTGTACCAGTACA 

ATTAAAGCCAGGAATGGATGGCCCAAAGGTTAAAGAATGGCC 

ATTGACAGAAGAGAAAATAAAAGCATTAGTAGAAATTTGTAC 

AGAAATGGAAAAGGAAGGAAAGATTTCAAAAATTGGGCCTGA 

AAATCCATACAATACTCCAGTATTTGCCATAAAGAAAAAAGA 

TGGTACTAAATGGAGAAAATTAGTAGATTTCAGAGACCTTAA 

TAAGAGAACTCAAGACTTCTGGGAAGTTCAATTAGGAATACC 

ACATCCCTCAGGATTAAAAAAGAAAAAATCAGTAACAGTACT 

GGATGTGGGTGATGCATACTTTTCAGTTCCCTTAGATGAAAA 

CTTCAGGAAGTATACTGCATTTACCATACCTAGTATAAATAA 

TGAGACACCAGGGATTAGATATCAGTACAATGTGCTTCCACA 

GGGATGGAAAGGATCACCAGCAATATTCCAAAGTAGCATGAC 

AAGAATCTTAGAGCCTTTTAGAAGACAAAATCCAGACATAGT 

TATCTATCAATACATGGATGACTTGTATGTAGGATCTGATTT 

AGAAATAGGACAGCATAGAATAAAAATAGAGGAACTGAGACA 

ACATCTGTTGAAGTGGGGATTTACCACACCAGACAAAT^AGCA 

TCAGAAAGAACCCCCATTCCTTTGGATGGGTTATGAACTCCA 

TCCTGATAAATGGACAGTGCAACCTATAGTACTGCCAGAAAA 

AGACAGCTGGACTGTCAATGACATACAGAAGTTAGTGGGTAA 

ATTAAATTGGGCAAGTCAGATTTACCCAGGAATTAAAGTAAG 

GCAATTATGTAAACTCCTTAGGGGAACCAAAGCACTAACAGA 
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AGTAATACCACTAACAGAAGAAGCAGAGCTAGAACTGGCAGA 

AAACAGGGAAATTCTAAGAGAACCAGTACATGGAGTGTATTA 

TGACCCATCAAAAGACTTAATAGCAGAAATACAGAAGCAGGA 

GCAAGGCCAATGGACATATCAAATTTATCAAGATCAATTTAA 

AAATCTAAAAACAGGAAAGTATGCAAGATTGAGGGGTGCC CA 

CACTAATGATGTAAAACAATTTCCAGAGGCAGTGCAAAAAAT 

AGCCACAGAAAGCATAGTAATATGGGGAAAGACTCCTAAATT 

TAGACTACCCATACAAAAAGAAACATGGGACGCATGGTGGAC 

AGAGTATTGGCAAGCCACCTGGATTCCTGAGTGGGAGTTTGT 

CAATACCCGTCCCCTAGTAAAATTATGGTACCAGTTAGAAAA 

AGAACCCATAATAGGAGCAGAAACTTTCTATGTAGATGGGGC 

AGCTAACAGAGAGACTAAATTAGGAAAAGCAGGATATGTTAC 

TGACAGAGGAAGACAAAAAGTTGTCTCCCTAACTGACACAAC 

AAATCAGAAGACTGAGTTACAAGCAATTCATCTAGCTTTGCA 

GGATTCAGGATTAGAAGTAAACATAGTAACAGACTCACAGTA 

TGCATTAGGAATCATTCAAGCACAACCAGATAAAAGTGAATC 

AGAAATAGTCAATCAAATAATAGAGCAATTAATAAAAAAGGA 

AAAGGTCTACCTGGCATGGGTACCAGCAGAGAAAGGAATTGG ; 

AGGGAATGAACAAGTAGATAAATTAGTCAGTGCTGGAATCAG ? 

GAAAATACTATTTTTAGATGGAATAGATAAGGCACAAGAAGG 

CCATGAGAAATATCACAGTAATTGGAGAGCAATGGCTAGTGG 

TTTTAACCTGCCACCTATAGTAGCAAAAGAAATAGTAGCCAG 

CTGTGATAAATGTCAGCTAAAAGGAGAAGCCATGCATGGACA 

AGTAGACTGTAGTCCAGGAATATGGCAACTAGATTGTACACA 

TCTAGAAGGAAAAATTATCCTGGTAGCAGTTCATGTAGCCAG 

TGGATATATAGAAGCAGAAGTTATTCCAGCAGAGACAGGGCA 

GGAAACAGCATACTTTATCTTAAAATTAGCAGGAAGGTGGCC 

AGTAAACACAATACATACAGACAATGGCGGCAATTTCATCAG 

TACCACGGTTAAGGCCGCCTGTTGGTGGGCAGGGATCAAGCA 

GGAATTTGGCATTCCCTACAATCCCCAAAGCCAAGGAGTAGT 

GGAATCTATGAATAGAGAATTAAAGAAAATTATAGGACAGGT 

AAGAGATCAGGCTGAACATCTTAAGACAGCAGTACAAATGGC 

AGTATTCATCCACAATTTTAAAAGAAAAGGGGGGATTGGGGG 

ATACAGTGCAGGGGAAAGAATAGTAGACATAATAGCAACAGA 

CATACAAACTAAAGAATTACAAAAGCAAATTACAAAAATTCA 

AAATTTTCGGGTTTATTACAGGGACAGCAGAGATCCACTTTG 

GAAAGGACCAGCAAAACTTCTCTGGAAAGGCGAAGGGGCAGT 

AGTAATACAAGATAATAGTGACATAAAAGTAGTGCCAAGAAG 

AAAAGTAAAGATCATTAGGGATTATGGAAAACAGATGGCAGG 
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TGATGATTGTGTGGCAAGTAGACAGGATGAGGATTAGAACAT 
GGAACAGTTTAGTGAAACACCATATGTATGTTTCAAAGAAAG 
CTAAGGGATGGATTTATAGACATCACTATGAAAACACTCATC 
CAAAAATAAGCTCAGAAGTACACATCCCACTAGGGGAAGCTA 
GATTGGTAATAACAACATATTGGGGTCTACATACAGGAGAAA 
GAGACTGGCATTTGGGTCAGGGAGTCTCCATAGAATGGAGGG 
AAAGGACATATAGAACACAAGTAGACCCCGAACTAGCAGACC 
AACTAATTCATATACATTACTTTGATTGTTTTTCAGAATCTG 
CCATAAGAAGTGCCATATTAGGATATAGAGTTAGGCATAGGT 
GTGAATATCAAGCAGGACATAACAAGGTAGGATCTCTACAGT 
ACTTGGCACTAACAGCATTAA TAACACCAAAGAAGATAAAGC 
CACCTTTGCCTAGTGTTGCGAAACTGACAGAGGATAGATGGA 
ACAAGCCCCAGAAGACCAAGGGCCACAGAGGCAGCCATACAA 
TGAATGGACACTAGAACTTTTAGAGGAGCTTAAGAATGAAGC 
TGTTAGGCATTTTCCTAGGGTATGGCTCGATGGCTTAGGGCA 
ACATATCTATGAAACTTATGGGGATAGTTGGGAAGGAGTGGA • 
GGCCATAACAAGAACTCTGCAACAACTGCTGTTTATTCATTT 
. CAGAATTGGGTGTCAACATAGCAGAATAGGCATTATTCGACA 
GAGGAGAGCAAGAAATGGAGCCAGTAGATGGTAGACTAGAGC 
CCTGGAAGCATCCAGGAAGTCAGCCTAAGACTGCGTGTACCA . 
CTTGCTATTGTAAAAAGTGCTGCTTTCATTGCCAAiSTTTGTT 
TTATGACAAAAGGCTTAGGCATCTCCTATGGCAGGAAGAAGC 
GGAGACAGCGACGAAGAGCTCCTCAAGACAGTCAGACTCATC 
AAGCTTATCTATCAAAGCAGTAAGTAATATATGTAATGCAAC 
CTTTACAAATAGTAGCAATAGTAGCATTAGTAGTAGCAGGAA 
TAATAGCAATAGTTGTGTGGACCATAGTATTCATAGAATATA 
AGAAAATATTAAGACAAAGAAAAATAGACAGGTTGATTGATA 
GAATAAGAGAAAGAGCAGAAGACAGTGGCAATGACAGTGAAG 
GGGATCAGGAAGAATTATCGGCACTTGTGGACATGGGGCACC 
ATGATCCTTGGGATATTAATGATCTGTAGAGCTGCAAACAAT 
TTGTGGGTCACAGTCTATTATGGGGTACCTGTGTGGAGAGAA 
GCAACCACCACTCTATTTTGTGCATCAGATGCCAAGGCATAT 
GATGCAGAGGTACATAATGTTTGGGCCACACATGCCTGTGTA 
CCCACAGACCCTAACCCACAAGAAGTAGAATTGAAAAATGTG 
ACAGAAAATTTTAACATGTGGAAAAATAACATGGTAGAACAG 
ATGCATGAGGATATAATCAGTTTATGGGATCAAAGCCTGAAG 
C CATGTGTAAAATTAAC CC CACT CTGTGTTT CTTTAAATTGC 
ACTGATGCTACTAATACCACTAATAGTAATACCACTAGCAGC 
AGCGAGAAACCGAAGGGGACAGGGGAAATAAAAAACTGCTCT 
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TTCAATATCACCACAAGCATAAGAGATAAGGTGCAGAAACAA 

TATGCACTTTTTTATAGCCTTGATGTAGTACCAATGGATGAT 

AATGATAATAGTACAAGCTATAGGTTAATAAGTTGTAACACC 

TCAATCATTACACAGGCCTGTCCAAAGATATCCTTTGAGCCA 

ATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTA 

AAGTGTAAAGATAAAAGGTTCAATGGAAAAGGAC CATGTACA 

AGTGTCAGCACAGTACAGTGTACACATGGAATTAGGCCAGTA 

GTATCAACTCAACTGTTGTTAAATGGCAGTCTAGCAGAAGAA 

GAGGTAGTAATTAGATCTGACAATTTTACGAACAATGCTAAA 

ACCATAATAGTACAGCTGAGCAAATCTGTAGAAATTACTTGT 

GTAAGACCCAACAACAATACAAGAAAAAGTATAAGTATGGGA 

CCAGGGAGAGCATTTTATACAACAGAAATAATAGGAGATATA 

AGACAAGCATATTGTAACATTAGTAAAGCAAACTGGACTGAC 

ACTTTAGAACAGATAGCTAGAAAATTAAGAGAACAATTTGAG 

AATAAAACAATAGTCTTTAAGCCATCCTCAGGAGGGGACCCA 

GAAATTGTAACACAGTTTTACAGTTTTAATTGTGGAGGGGAA 

TTTTTGTACTGTAATTGAACACAACTGTTTAATGGTACTTGG 

AATGGTACTTGGGTTAATGGTACTTGGAGTAGTAATAATACG 

ACTGATACTGCAAATATGAGACTCCCATGCAGAATAAAACAA 

TTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCC 

CCTCCCATCAAAGGACAAATTAAATGTACATCAAATATTACA 

GGGCTGATATTAACAAGAGATGGTGGTAACAATAACACCACG 

AACGACAACGAGACCGAGACCTTCAGACCTGGAGGAGGAGAT 

ATGAGGGACAATTGGAGAAGTGAATTATATAAATATAAAGTA 

GTACAAGTTGAACCATTAGGAGTAGCACCCACCAAGGCAAAG 

AGAAGAGTGGTGCAAAGAGAAAAAAGAGCAGTGGGAATAGGA 

GCTATGTTCCTTGGGTTCTTAGGAGCAGCAGGAAGCACTATG 

GGCGCAGCGTCAGTGACGCTGACGGTACAAGCCAGACAATTA 

TTGTCTGGTATAGTGCAGCAGCAGAACAATCTGCTGAGGGCT 

ATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGC 

ATCAAACAGCTCCAGGCAAGAGTCCTGGCTGTGGAAAGATAC 

CTAAGGGATCAACAGCTCCTGGGACTTTGGGGTTGCTCTGGA 

AAACTCATTTGCACCACTACTGTGCCTTGGAACAATAGCTGG 

AGTAATAAATCTCTGGAAACAATTTGGGATAACATGACCTGG 

ATGCAGTGGGAAAGAGAAATTGACAATTACACAAACATAATA 

TACACCTTAATTGAAGAATCGCAGAACCAACAAGAAAAAAAT 

GAACTAGAATTATTGGAATTGGATAAATGGGCAAATTTGTGG 

AATTGGTTTAGTATATCAAACTGGCTATGGTATATAAAATTA 

TTCATAATGGTAGTAGGAGGCTTGGTAGGTTTAAGAATAGTT 
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TTTACTGTACTTTCTATAGTTAATAGAGTTAGGCAGGGATAC 

TCACCATTATCGTTTCAGACCCACCTCCCAACCCCGAAGGGA 

CCCGACAGGCCAGAAGGAATCGAAGAAGAAGGTGGAGAGAGA 

GACAGAGGCAGCTCCACTCGATTAGTGCACGGATTCTTAGCA 

CTTTTCTGGGACGACCTGAGGAGTCTGTGCCTCTTCAGCTAC 

CACCACTTGAGAGACTTACTCTTGATTGTAACGAGGATTGTG 

GAACTTCTGGGACGCAGGGGATGGGAAGCCCTCAAATACTGG 

TGGAATCTCCTGCAGTATTGGAGGCAGGAACTACAGAAGAGT 

GCTGTTAGCTTGTTCAATGGCACGGCCATAGCAGTAGCTGAG 

GGGACAGATAGAGTTATAGAAGCTTTACGAAGGGCTTATAGA 

GCTATTCTCCACATACCTAGAAGAATAAGACAGGGCTTAGAA 

AGGGCTTTGCTATAAAATGGGTGGCAAGTGGTCAGAAAGTAG 

TGTGGTTAGAAGGCATGTACCTTTAAGACAAGGCAGCTATAG 

ATCTTAGCCGCTTTTTAAAAGAAAAGGGGGGACTGGAAGGGC 

TAATTCACTCACGGAAAAGACCAGTTGAACCAGAAGAAGATA 

GAAGAGGCbATGAAGAAGAAAAC^C^GATTGTTCTGCTTGC £ 

TCAGCTGGGGACTTTCCAGAAGGCGCGGCCTGAGTGACTAAG 1 

P CCCQTTGdfGGACTTTCCGAAGAGGCATGAAGGGACTTTCCA • 

AGGCAGGCGTGGCCTGGGCGGGACTGGGGAGTGGCGAGCCCT 

CAGATGCTGCATATAAGCAGCTGCTTTCTGCCTGTACTGGGT 

CTCTCTGGTTAGACCAGATCTGAGCCTGGGAGCTCTCTGGCT 

AGCTAGGGAACCCACTGCTTAAGCCTCAATAAAGCTTGCCTT 

GAGTGCTTCAAGTAGTGTGTGCCCGTC^GTTGTGTGACTCTG 

GTATCTAGAGATCCCTCAGACCATTTTAGTCCGTGTGGAAAA 
TCTCTAGCA 
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